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Preface 


The new particle physics of the past 30 years, including electroweak theory, 
quantum chromodynamics, grand unified theory, supersymmetry, supergravity 
and superstring theory, has greatly changed our view of what may have happened 
in the universe at temperatures greater than about 10'5 K (100 GeV). Various 
phase transitions may be expected to have occurred as gauge symmetries which 
were present at higher temperatures were spontaneously broken as the universe 
cooled. At these phase transitions topological defects, such as domain walls, 
cosmic strings and magnetic monopoles, may have been produced. Various 
types of relic particles are also expected. These may include neutrinos with 
small mass and axions associated with the solution of the strong CP problem 
in quantum chromodynamics. If supersymmetry exists, there should also be 
relic supersymmetric partners of particles, some of which could be dark matter 
candidates. If the supersymmetry is local (supergravity) these will include the 
gravitino, the spin-3 partner of the graviton. Insight may also be gained into 
the observed baryon number of the universe from mechanisms for baryogenesis 
which arise in the context of grand unified theory and electroweak theory. 
Supersymmetry and supergravity theories may have scope to provide the particle 
physics underlying the inflationary universe scenario that resolves such puzzles 
as the extreme homogeneity and flatness of the observed universe. Superstring 
theory also gives insight into the statistical thermodynamics of black holes. In 
the context of superstring theory, bold speculations have been made as to a period 
of evolution of the universe prior to the big bang ('pre-big-bang' and 'ekpyrotic 
universe' cosmology). 

These matters, amongst others, are the subject of this book. The book gives 
a flavour of the new cosmology that has developed from these recent advances 
in particle physics. The aim has been to discuss those aspects of cosmology that 
are most relevant to particle physics. From some of these it may be possible to 
uncover new particle physics that is not readily discernible elsewhere. This is a 
particularly timely enterprise, since, as has been noted by many authors, the recent 
data from WMAP and future data expected from Planck mean that cosmology 
may at last be regarded as precision science just as particle physics has been for 
many years. 


xi 


xii Preface 


We are grateful to our colleagues Nuno Antunes, Mar Bastero-Gil, Ed 
Copeland, Beatriz de Carlos, Mark Hindmarsh, George Kraniotis, Andrew Liddle, 
André Lukas and Paul Saffin for the particle and cosmological physics that we 
have learned from them. Special thanks also to Malcolm Fairbairn for helping 
us with the diagrams. Finally, we wish to thank our wives for their invaluable 
encouragement throughout the writing of this book. 

We intend to maintain an updated erratum page for the book at 
http://www.pact.cpes.sussex.ac.uk/~mpfg9/cosmobook.htm. 


David Bailin and Alexander Love 
June, 2004 


Chapter 1 


The standard model of cosmology 


1.1 Introduction 


The principal concern of this book is the way in which recent particle physics, 
including electroweak theory, quantum chromodynamics, grand unified theory, 
supersymmetry, supergravity and superstring theory, has changed our standpoint 
on the history of the universe when its temperature was greater than 10? K. This 
will be studied in the context of the Friedman-Robertson- Walker solution of the 
Einstein equations of general relativity. In this chapter, therefore, our first task 
is the derivation of the field equations relating the scale factor R(t) that appears 
in the metric to the energy density p and the pressure p that characterize the 
(assumed homogeneous and isotropic) energy-momentum tensor. This is done 
in the following two sections. In section 1.4 we show how, for a given equation 
of state, energy-momentum conservation determines the scale dependence of the 
energy density and pressure. The standard solutions for the time dependence of 
the scale factor in a radiation-dominated universe, in a matter-dominated universe, 
and in a cosmological constant-dominated universe are presented in section 1.5; 
we give an estimate of the age of the universe in the matter-dominated case in 
section 1.6. In section 1.7, we present the evidence that there is, in fact, a non- 
zero cosmological constant and discuss why its size is so difficult to explain. The 
discussion of phase transitions and of relics that is given in later chapters also 
requires a description of the thermodynamics of the universe. So in the following 
two sections we describe the equilibrium thermodynamics of the expanding 
universe and derive the time dependence of the temperature in the various epochs. 
In section 1.10, we discuss briefly the ‘recombination’ of protons and electrons 
that left the presently observed cosmic microwave background radiation. Finally, 
the synthesis of the light elements that commenced towards the end of the first 
three minutes is discussed in section 1.11. The consistency of the predicted 
abundances with those inferred from the measured abundances determines the 
so-called baryon asymmetry of the universe, whose origin is discussed at length 
in chapter 4. 


DOI: 10.1201/9780367806637-1 | 


2 The standard model of cosmology 
1.0 The Robertson-Walker metric 


The standard description of the hot big bang assumes a universe which is 
homogeneous and isotropic with a metric involving a single function R(t), 
the 'scale factor' (or 'radius' of the universe). The appropriate metric is the 
Robertson- Walker metric 


2 
1 — kr? 


ds? = di? — R2(t) ( +r? do? + r? sin? 0 a) (1.1) 


where the (time and spherical polar) coordinates (t, r, 0, $), called the ‘comoving’ 
coordinates, are the coordinates of an observer in free fall in the gravitational 
field of the universe. The parameter k takes the values —1,0, 1 corresponding 
to a universe which has spatial curvature which is negative, zero or positive, 
respectively. (This can be seen from the curvature scalar derived from the second 
equality of (1.30) with a change in sign for Euclidean rather than Minkowski 
space.) Units have been chosen in which the speed of light c is 1. 

An immediate use of this metric is to calculate the size of regions of the 
universe that have been in causal contact (in the sense that there has been the 
possibility of causal influence occurring between points within the region at some 
time between the big bang at ¢ = 0 and time t). Causal influences cannot occur 
over distances greater than the (proper) distance dy (t) that light has been able to 
travel from the the big bang at t = 0 to the time t being studied. This distance 
is called the ‘particle horizon’. Without loss of generality, consider emission of 
a light signal from coordinate (r, 0, $) at t = 0 to coordinate (0, 0, $) at time t 
along the (radial) geodesic with 0 and $ constant. (It may be checked that this is 
indeed a geodesic by using the coefficients of affine connection given in the next 
section (exercise 1).) For a light beam, ds? = 0 and we have 


dt? dr? 


RD 1—kr2’ (1.2) 


Thus, the largest value of r at t = 0 to be in causal contact with r = O at time f is 


given implicitly by 
t dt’ r dr’ 
—- = —————á. 1.3 
Í R(t’) Í 1 —kr (0 


This equation determines the particle horizon. The proper distance to the particle 
horizon at time f is 


r dr' 
du(t) = R(t ———L 
n (t) of 5 


! gr 
=R ——. 1.4 
of RG) (1.4) 
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We shall discuss the time dependence of the scale factor R(t) in the next section. 
Equation (1.4) then allows us to calculate the particle horizon. For example, when 


RQ) « (2? (1.5) 
as is the case for a matter-dominated universe, we get 

dg (t) = 3t (1.6) 
and for a radiation-dominated universe in which 

R(t) x (7? (1.7) 


we get 
dg (t) = 2t. (1.8) 


For an inflationary universe, such as will be discussed in chapter 7, 
R(t) xe"? (1.9) 


with H approximately constant, and then 
l 

dy(t) = — (e! — 1). 1.10 

H(t) H‘ ) (1.10) 


The Robertson—Walker metric also allows us to calculate the redshifting of 
light from distant objects. Consider light, travelling on a radial geodesic, being 
received at r = 0 at (around) the present time t = to from a distant galaxy at 
r = rj. Suppose that two adjacent crests of a light wave are received at t = to 
and; = tọ + Atọ having been emitted from the distant galaxy at t = fj and 
t = t + At). Equation (1.3) applies but with appropriate modifications to the 
limits of integration. Thus, 


[ sf x (1) 
t R(t) 0 ] —kr 
ane fot Ato qr "n dr 
f —— = Í ———. (1.12) 
nan R(t) 0 l—kr 
Subtracting gives 
[emu (1.13) 
tj Ah R(t) ti R(t) 
so that 


tot Ato dt n +An dt 
Í — = Í —. (1.14) 
e RO J, R(t) 


4 The standard model of cosmology 
Because the variation of R(t) on the time scale of an electromagnetic wave period 
is very small, this equation may be approximated by 
Ato at 
R(t) R) 


But Ato and Af are the times between adjacent crests; in other words, they are 
the periods of the waves. Thus, the waves have frequencies 


(1.15) 


vo = — and v = — (1.16) 


respectively and, in units where c = 1, wavelengths 
ào = Ato and Ay = An (1.17) 

respectively. The redshift is usually defined by 
Ao — Ai 

za S 


1.18 
" (1.18) 
and, from (1.15), we conclude that 
R(to) 
I +z = —. 1.19 
R(t) (1.19) 


Equations (1.19) and (1.17), reinterpreted in terms of photons, mean that a photon 
emitted at time t undergoes a redshifting of its wavelength as the universe 
expands, such that its wavelength at time fo is increased by a factor R(to)/ R(t). 
Since the momentum (or energy) of the photon is inversely proportional to its 
wavelength, the momentum (or energy) of the photon is reduced by a factor 
R(t))/R(to) as a result of the expansion of the universe. This is often expressed 
as energy of photons being redshifted away. 
When |f; — tol is not too large, we can make the expansion 


R(t) = RCo) + (1 — to) R(to) + $C — to)? R(to) + --- 
= R(to)(1 + Holt — to) — 3qoHg(ti — t)? +) (1.20) 
where 
Ho = —— (1.21) 
is the present value of the Hubble parameter and qo is the present deceleration 
parameter . . 
. | R(t)  R(o)R(t) 
R(to) Hj R(to)? 
The redshift may also be expanded in powers of tı — to: 


(1.22) 


1+z=(1 + Holti — to) — $go Hg (ti — to)? 9. )7! (1.23) 
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leading to 
z= Hot - 1) + (i+ 9) nto - ny +e. (1.24) 


Since z is the physically measurable quantity, it is useful to invert (1.24). For 


small z 
| 1 
— f = — 1 1.25 
to —ti zl- ~ (14500) 2+--]. (1.25) 


Then, after expanding 1/ R(t) in (1.11) in powers of t — to, we may determine rj 
as a function of z. Expanding (1.11) gives 


1 
R zu; |o -n)t- ; Hoo =n)? + | =rit+ O(r}). (1.26) 


Thus, in terms of the redshift, 


l l 2 
e]. l. 
r= R (to) Ho É 50 + 402 t | (1.27) 


We shall use this result in section 1.7 to calculate the ‘luminosity distance’ of a 


(supernova) source as a function of the redshift. 


1.3 Einstein equations for a Friedmann-Robertson-Walker 
universe 


It is straightforward to calculate the coefficients of affine connection for the metric 
(1.1). The non-zero components are 


R R 
ry = Du jo = ou = To; (1.28) 
ry = 58 "(81 + Oj 81k — 918 ji). (1.29) 


Here x’, i = 1, 2,3, denotes the (spatial) coordinates (r, 0, ¢). Equation (1.29) 
is just the coefficients of affine connection for the three-dimensional subspace 
(7,6, 9). It is also straightforward to calculate the Ricci tensor R,, from the 
cofficients of affine connection (exercise 2). It has non-zero components 


R R OR? 2k 
Ro = -3% and Rij = diem + Jj (1.30) 


. RR ok 
R = g^ Rw =-6| 545+ Bl (1.31) 


6 The standard model of cosmology 


The Einstein equations for the Robertson-Walker metric, usually referred to 
as the Friedman-Robertson- Walker (FRW) universe, are 


Rav — 3Rgy, = 81GNTus + Aga (1.32) 


where Gy is the Newtonian gravitational constant, 7,,, is the energy-momentum 
tensor and we are including a cosmological constant A. For a perfect fluid with 
energy density p and pressure p, the non-vanishing components are 


Too — p and Tij = —póij. (1.33) 


The corresponding Einstein equations are, from the 00-component, 


R k — 81Gy. A 
| 45 = += 34 
(2) + 3 ^*3 (134) 


usually referred to as the ‘Friedmann’ equation, and, from the i j-components, 


2 


"LM R +4 =—-8xGnpt+A (1.35 
R R Rm NP . ` ) 


Subtracting (1.35) from (1.33) gives the equation for R 


Ë ^— 4nGN 


A 
R 3 (p + 3p) t. (1.36) 


In the case A = O, this equation implies that R < O for all times’. Then, the 
present positive implies that R was always positive and, therefore, that R was 
always increasing. Consequently, ignoring the effects of quantum gravity, there 
was a past time when R was zero—the moment of the ‘big bang’. 

Returning to the Friedmann equation (1.34) with zero cosmological constant, 
the universe is spatially flat when 


3H? 
P= Pe= Ra. 3M2 H? (1.37) 
where H is the Hubble parameter, 
R 
H= r (1.38) 
and Mp is the reduced Planck mass given by 
2 l mp 


Lf (1.39) 


! A positive value of the acceleration Ë can only arise if A is positive. 
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where m p is the Planck mass, and 
Mp = 2.44 x 1015 GeV mp = 1.22 x 10? GeV. (1.40) 


Since the Hubble parameter varies with time, so does p,. The density parameter 


$2 is defined as 


Qa (1.41) 


Pe 


and measures the density as a fraction of the ‘critical’ density pe. The current 
value of Q, denoted by £20, has a value [1] 


Qo = 1.02 + 0.02. (1.42) 


1.4 Scale factor dependence of the energy density 
There is also conservation of the energy-momentum tensor to take into account: 
D,T"" =0 (1.43) 


where 
D,V* =a,V" + rv? (1.44) 


is the action of the covariant derivative D; on a contravariant index. The 4 = 0 
component of (1.43) yields (exercise 3) 


. R 
+30 + py =O. (1.45) 
It is easy to see that this is just the first law of thermodynamics 
dE + pdvV=0 (1.46) 


for a comoving volume V « R3(t). 
The energy density o may be related to the scale factor R(t) once we have 
the equation of state. If this is of the form 


p= up (1.47) 


then (1.45) leads to 
p x Rt (1.48) 


In particular, for w = m corresponding to radiation (massless matter) 
pn R$ radiation p= i p- (1.49) 
For w = 0, corresponding to massive matter, 


px RO matter p=0. (1.50) 


8 The standard model of cosmology 


Equation (1.50) may be understood as a constant number of massive particles 
occupying a volume expanding as R? (t) as the universe expands. Equation (1.49) 
may be understood as the number density of photons (or other massless particles) 
decreasing as R7? (t), as for massive matter but, in addition, the energy of each 
photon decreasing as R~'(t) because of the redshifting of the photon energy 
discussed in section 1.2. Another interesting case is w = — 1, which gives 


p = constant p^-p. (1.51) 


This may be interpreted as vacuum energy and allows us to incorporate the 
cosmological constant into the discussion without introducing it explicitly, if we 
wish. 


1.5 Time dependence of the scale factor 


It is easy to solve the Friedmann equation (1.34) in the case of zero cosmological 
constant and k = 0, a spatially flat universe. Both of these assumptions are 
always good approximations for sufficiently early times because, as discussed 
in section 1.4, p x R~‘* for radiation domination and p œ R`? for matter 
domination. Consequently, for a 'big-bang' universe with R — Oast — 0, 
the $xGup term in (1.34) becomes more important than the k/R? or A/3 
terms. With the energy density p given by (1.48), the solution of (1.34) (provided 
w Æ —1)is 


R(t) x 173099, (1.52) 
In particular, 
R x (172 and H= NE for radiation domination (1.53) 
and 
R œt’ and H= $C! for matter domination. (1.54) 


However, if at some stage in the history of the universe the cosmological constant 
is (positive and) large enough to dominate over the energy density and curvature 
terms in (1.34), then the Friedmann equation has the solution 


R(t) x EB r- (1.55) 


This is the de Sitter universe. 


1.6 Age of the universe 


We shall estimate the age of the universe in the case A = 0. We shall also 
assume a matter-dominated universe for the calculation. This is a reasonable 
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approximation because, as can be seen from section 1.8, the universe was matter- 
dominated for most of its history. First, rewrite the Friedmann equation (1.34) in 
terms of the value pọ of the energy density p today. From (1.50), 


R -3 
Z = (x) (1.56) 


Thus, the Friedmann equation may be written as 


2 


R k 8rGn Ro 
=)+ 55 zo 1.57 
(zs) "RO MOR (97) 


Next rewrite this in terms of the present value Qo of the density parameter (1.41): 


po 
Ro = ———————. 1.58 
97 (3/81Gn)H? (1.38) 
Then, at t = 19, (1.57) gives 
k | 81GN 
= = —— 0 - H Hi(Qe- 1) (1.59) 
R2 3 


where the last equality employs (1.58). Thus, the Friedmann equation may be 


written as 2 


R R 

— | + H2(Q0- 1) = RZ., (1.60) 

Ro R 
This may be rewritten in terms of the variable 

R 
= — 1.61 
x Ro (1.61) 

as 

i) Hà(Qo — 1) = Qo Hx! (1.62) 
with solution uu" ae’ 

TAS Í I (1.63) 

Ho Jo Jox! 1) +1 


In particular, today, when R = Ro, x has the value | and the current age of the 
universe is 


to (1.64) 


an 
~ Ho Jo S&a -D1 


We see that to ~ hh with the precise value depending on the value of $29. For 
example, for an exactly flat universe (which is not consistent with observations) 
Qo = 1 and to = 2 Hg |. It is usual to write Ho ' in the form 


Hg! = h719.78 x 10° yr (1.65) 
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where the parameter h is measured to have the value 

h = 0.72 + 0.05. (1.66) 
Thus, the present age of the universe is 


to ~ 1019 yr, (1.67) 


1.7 The cosmological constant 


In 1917, attempting to apply his general theory of relativity (GR) to cosmology, 
Einstein sought a static solution of the field equations for a universe filled with 
dust of constant density and zero pressure. The general static solution of (1.34) 


and 


=> = =. 1.69 

Ri 3 ^*3 (1.69) 
With zero cosmological constant (A = 0), the only solution of these equations, 
apart from an empty, flat universe, requires that either the energy density p or the 
pressure p is negative. It was this unphysical result that led him to introduce the 
cosmological term. Then the solution for pressureless dust is 


A 


= —— 1.70 
P= Gn (1.70) 
and 
k 
rn A. (1.71) 
Assuming that p is positive requires that A is positive, so that 
k=+1 (1.72) 
and | 
R= (1.73) 


VA 
Hence, the universe is closed and has the geometry of 5? with volume V and mass 
M given by 


V= RP =A M= (1.74) 


x 
2GN A. 
A non-zero cosmological constant also allows non-trivial static (de Sitter) 
solutions of the Einstein field equations with no matter (9 = 0 = p)atall. It 
was, therefore, a considerable relief in the 1920s when the redshifts of distant 
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galaxies were observed, the presumption of a static universe could be abandoned 
and there was no need for a cosmological constant. 

However, anything that contributes to the energy density of the vacuum (p) 
acts just like a cosmological constant. This is because the Lorentz invariance 
of the vacuum requires that the energy-momentum tensor in the vacuum (Tuy) 
satisfies 

(Tav) = (0)8uv- (1.75) 


Then, by inspection of (1.32), we see that the vacuum energy density contributes 
8x Gy (p) to the effective cosmological constant 


Aeg = A 4-81 Gy (p). (1.76) 


Equivalently, we may regard the cosmological constant as contributing A/82 Gn 
to the effective vacuum energy density 


Pac = (P) + TET = New M3. (1.77) 
Thus, a cosmological constant is often referred to as ‘dark energy’, not to be 
confused with dark matter which contributes to the non-vacuum energy density 
(and has zero pressure). 

A priori, in any quantum theory of gravitation, we should expect the scale 
of the vacuum energy density to be set by the Planck scale Mp. Since A has 
the dimensions of M, it follows that we should have expected that A/ MÀ ~]. 
We shall see that, in reality, the scale of any such energy density must be much 
smaller. We noted in section 1.5 that the effect of the cosmological constant is 
negligible at sufficiently early times, because the energy density p scales as a 
negative power of R for radiation or matter domination. Thus, the most stringent 
bounds arise from cosmology when the expansion of the universe has diluted the 
matter energy density sufficiently. From the observation that the present universe 
is of at least of size Ay , we may conclude that 


|Aer| € 3H (1.78) 
where 
Hy | ~ 10'° yr ~ 10% Gev! (1.79) 
from (1.67). Then, in Planck units, 
^ 
Rett < 107120, (1.80) 
Mp 


For many years, this tiny ratio was taken as evidence that the cosmological 
constant is indeed zero. However, during the past few years, evidence has 
accumulated that A is, in fact, non-zero. 
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The first evidence suggesting this came from measurements of the redshifts 
of type Ia supernovae. Such supernovae arise as remnants of the explosion of 
white dwarfs which accrete matter from neighbouring stars. Eventually the white 
dwarf mass exceeds the Chandrasekhar limit and the supernova is born after the 
explosion. The intrinsic luminosity of such supernovae is considered to be a 
constant. That is, they are taken as standard candles and any variation in their 
apparent luminosity as measured on earth must be explicable in terms of their 
differing distances from the earth. In a Euclidean space, the apparent luminosity / 
of a source with intrinsic luminosity L at a distance D from the observer is given 
by 

|. L 

T 47 D2" 
We may, therefore, define the ‘luminosity distance’ Dg of a source from the 
observer by 


(1.81) 


Di; 2j. (1.82) 


In GR we must be more careful. So consider the circular mirror, area A, of a 
telescope at the origin, normal to the line of sight to a source at r1. Light emitted 
from the source at time fı and arriving at the mirror at time tọ is bounded by a 


cone with solid angle 
A 


w = ——F 
4n R(to)*r? 
as measured in the locally inertial frame at the source. The emitted photons have 
their energy redshifted by a factor 
Rn) 1 
Ro) 1 +z 


(1.83) 


(1.84) 


as explained in section 1.2, (see (1.18)). Also, photons emitted at time intervals of 
ôtı reach the mirror at time intervals ŝto = ôtı R(to)/ R(t). Thus, the total power 
P received at the mirror is given by 


(Rwy 


and the apparent luminosity by 
P 
l=-. . 
A (1.86) 


Then, using (1.27), the luminosity distance defined in (1.82) is 
- | 
p, =H toz- 50 ane. (1.87) 


ee Peres pee eee 
=a [e+ 30 qo)z + | (1.88) 
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Figure 1.1. Hubble diagram giving the effective magnitude versus redshift for the 
supernovae in the primary low-extinction subset. The full line is the best-fit flat-universe 
cosmology from the low-extinction subset, the broken and dotted lines represent the 
indicated cosmologies. 


Hence, for nearby supernovae the luminosity distance is proportional to the 
redshift of the source. 

Astronomers measure the apparent magnitude m of the various supernovae 
sources. The difference m — M, where M ~ —19.5, is the (assumed constant) 
intrinsic magnitude of the source, is just the logarithm of the luminosity distance. 
So the apparent magnitude is predicted to be linear in Inz for small z. This is 
consistent with the data for z € 0.1, see figure 1.1 taken from [2]. For more distant 
supernovae the linear relationship between D, and z is distorted by quadratic 
terms depending on the present deceleration parameter qo of the universe. The 
data for 0.7 € z € 1 do display such a distortion, see figure 1.1 [2]. 

For an FRW universe, it follows from (1.36) and the definition (1.22) of qo 
that, in general, the deceleration may be written as 


qo = 39 0 + 3w)Q; (1.89) 


for a universe with components labelled by i having energy density p; and 
pressure p; = uwjpj; here Qj = pj/p. where pe = 3H2/81 GN is the 
critical density. In particular, for a universe with just (pressureless) matter and 
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Figure 1.2. 68%, 90%, 95%, and 99% confidence regions for Qm and Qa. 


a cosmological constant, we get 
do = 52m ~ Q4 (1.90) 


where Rm = Pm/ pc is the matter contribution and 2a = Pvac/Pc = Acr/3 Hg. 
As noted previously, a negative value of go, corresponding to an accelerating 
universe, can only arise with a positive cosmological constant. The data shown in 
figures 1.1 and 1.2 taken from [2] suggest that this is indeed the case. 

The determination of Qm and $24 requires at least one further input. The 
recent data on the temperature anisotropies of the cosmic microwave background 
provide just such a constraint. Photons originating at the ‘last scattering surface’, 
when matter and radiation decouple (see section 1.10), having a redshift z ~ 
1300, are seen now as the microwave background. Quantum fluctuations in 
the early universe give rise to fluctuations in the energy density of the radiation 
and these appear as temperature fluctuations in the microwave background (see 
section 7.7). These fluctuations may be analyzed by multipole moments, labelled 
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by l, and are characterized by their power spectrum. The multipole number [peak 
of the first peak in the power spectrum is determined by the total matter content of 
the universe. In fact, lpeak ~ 220829, where 29 = o/c measures the total energy 
density po relative to the critical density. The measured position of the first peak 
yields the value (1.42). Thus, for a universe with just matter and a cosmological 
constant, we get 

Qm o Q4 ~l. (1.91) 


When this resuit is combined with the supernova and other data, it is found that 
Rm ~ 0.3 Qa ~ 0.7. (1.92) 


In Planck units, this means that 


A 
ef Pe Qa x 0.8 x 10717, (1.93) 
My Mp Mp 


There is currently no known explanation of this extremely small number. It 
corresponds to od ~ 107? eV. It is generally believed that the particle physics 
vacuum is the minimum of an effective potential in which the electroweak gauge 
symmetry SU (2); x U(l)y is spontaneously broken (see section 2.5). The value 
of the effective potential at this minimum (p) has no effect on the particle physics. 
By adding a constant Vo to the tree-level potential (2.93), it is easy to arrange that 
the potential, including any radiative and temperature-dependent corrections, has 
any desired value at the minimum. However, to do so requires the fine tuning of Vo 
to ensure that the value (1.93) is obtained and it is this fine tuning that is regarded 
as unnatural and for which an explanation is sought. The obvious first approach 
to the problem is to seek a symmetry that requires A = 0 and then to explore 
mechanisms that break the symmetry only slightly. The only known symmetry 
that requires a vanishing cosmological constant is global supersymmetry. The 
(fermionic) supersmmetry generator Q satisfies the anticommutation relation 


(Q, Q} = 2y" P, (1.94) 
where P, is the energy-momentum vector. It follows [3] that, for any state |Y), 
(V|Polv) = (V1Qo Q5 + Q2 Qa ly) > 0. (1.95) 


Thus, the energy of any non-vacuum state is positive and the vanishing of the 
vacuum energy defines a unique, supersymmetric vacuum state |0) that satisfies 


(0| P9]0) = 0 & Q,|0) = 0. (1.96) 


In a supersymmetric theory, all particles have supersymmetric partners (called 
‘sparticles’) having opposite statistics. That is to say, the sparticle associated with 
a fermion is a boson and the sparticle associated with a boson is a fermion. The 
sparticles associated with the quarks and leptons, called respectively 'squarks' 
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and ‘sleptons’, are (spin-0) scalar particles and, in a supersymmetric theory, they 
must have the same mass and quantum numbers as the original particles. This 
has the important consequence that the vanishing cosmological constant result is 
unaffected by quantum effects, because supersymmetry ensures that any quantum 
corrections arising from fermion loops, say, are cancelled by those that arise 
from the bosonic loops of the associated sparticle. It has yet to be demonstrated 
experimentally that supersymmetry has anything to do with reality. None of the 
sparticles associated with the known particles has ever be seen. (It is hoped that 
they will be discovered at the Large Hadron Collider (LHC).) Supersymmetry 
(susy), if present at all, is therefore a broken symmetry. It then follows from 
(1.95) that the vacuum energy is positive definite. The experimental limits on the 
sparticle masses require that 


Msusy Z 100 GeV. (1.97) 


If something like this bound were to set the scale for pysc, then 


Dvac —68 
—~ 10°". (1.98) 
M$ 


Although small compared with the O(1) expected in a generic quantum theory 
of gravity, this is still very much larger than the value (1.93) derived from the 
supernovae and Wilkinson Microwave Anisotropy Probe (WMAP) data. Thus, 
if this were the only contribution to the vacuum energy density, we should be 
confronted with an unmitigated disaster. 

However, including gravity in any supersymmetric theory inevitably leads 
to a supergravity theory, in which supersymmetry is a local, rather than a global, 
symmetry. This is because in GR the momentum generator P, becomes a local 
field generating diffeomorphisms of spacetime. Then, in a supersymmetric theory 
incorporating GR, the supersymmetry generators too become local fields: this is 
why supergravity emerges as the low-energy limit of string theory. The form 
of the potential in a supergravity theory is given in section 2.8. The main 
point to note is that, as in the case of global supersymmetry, supersymmetric 
vacua are generally stationary points of this potential but that at such points the 
vacuum energy density is now generally negative. Non-supersymmetric (scalar) 
field configurations in which the energy density is zero do exist but (without 
fine tuning) these are not generally stationary points of the potential. Thus, 
supergravity does not solve the cosmological constant problem but it is no worse 
than in non-supersymmetric theories. 

In the absence of any theoretical insight into the origin of the smallness of the 
cosmological constant, it is of interest to see whether ‘anthropic’ considerations 
can shed any light on the issue. Using the ‘weak anthropic principle’, we seek 
to determine which era or which part of the universe could support human life, 
so that physicists exist to pose such questions. A large positive cosmological 
constant Jeads to an exponentially expanding (de Sitter) universe, see (1.55). 
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This exponential expansion inhibits the formation of the gravitationally bound 
clumps of matter that are presumably a necessary precondition for life to evolve; 
once the clumps are formed, the cosmological constant has no further effect. 
Thus, the weak anthropic principle requires Aeg to be small enough to allow 
the formation of sufficiently large clumps of matter. Gravitational condensation 
began in our universe at a redshift ze where ze > 4. The energy density of 
matter at that time was greater than the present matter density o,, by a factor 
of R3(to)/ RÌ (t-) = (1 +z)? > 125. The cosmological constant has no effect so 
long as it is dominated by the matter density. Thus, provided pyac S 12555, the 
vacuum energy density would not inhibit gravitational condensation. (A more 
careful treatment [4] gives a further factor of in?) We conclude that if the 
anthropic principle accounts for the value of the (positive) cosmological constant, 
then we should expect Pyac ~ (10 — 100)p,, because there is no anthropic 
reason for it to be smaller. This gives the prediction Qa ~ (10 — 100)Q,, at 
variance with the values (1.92) derived from the supernovae and WMAP data. 
Nevertheless, it implies a much smaller value Pyac /M$ than that given in (1.98) 
which was derived from supersymmetry considerations. 

In contrast, a negative cosmological constant does not affect gravitational 
clumping. We see from the Friedmann equation (1.34) that if A is negative, the 
expansion of the universe ceases (for a flat universe (k = 0)) when the matter 
density term is cancelled by the cosmological constant. We have already noted 
that the deceleration parameter qo given in (1.90) is positive for A « O. It follows 
that after expansion has ceased, the universe begins to contract and, in fact, it 
collapses to a singularity in a finite time T. It is easy to show (exercise 4) that 


2x 
JBiAT 
Anthropic considerations would then require that this leaves sufficient time for 


life to evolve, say T > 5H, where Hy = 4/3/8z G N Pm is the Hubble time in 
our universe. This would give 


T= 


(1.99) 


2 
aA < (F) (1.100) 
m 


Again, this would entail a much smaller value of Prac/ M$, than was obtained 
from supersymmetry considerations. However, the supernovae data indicate a 
universal acceleration rather than a deceleration. Thus, A is positive and the 
previous bound is only of academic interest. 


1.8 Equilibrium thermodynamics in the expanding universe 


It makes sense to discuss equilibrium thermodynamics during most of the history 
of the universe because reaction rates were much faster than the time scale for 
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the expansion of the universe which is characterized by the Hubble time H~!. 
As discussed in section 2.2, the pressure p, entropy density s and energy density 
p due to a gas of ultrarelativistic particles (in which the temperature T is much 
greater than all masses) are given by 


SINT (1.101) 

p= 90 * . 

Qn? 

= NT? 1.102 
$ 4S x ( ) 
ELT (1.103) 

P730 * ) 

where 

N, = Ng + $NF. (1.104) 


The numbers Ng and Nr of bosonic and fermionic degrees of freedom are defined 
after (2.19). The entropy S in a comoving volume R(t) 


S=sR3 (1.105) 


is expected to be conserved because a homogeneous universe has no temperature 
differences to generate heat transfer. (For an explicit proof of entropy 
conservation, see section 3.4 of Kolb and Turner or section 15.6 of Weinberg in 
the general references.) Thus, to the extent that the entropy density is dominated 
by the ultra-relativistic particles 


RT = constant (1.106) 


while M, is constant. Equation (1.106) is valid even for a matter-dominated 
universe because it is only the particles with mass m smaller than the temperature 
T that are present in thermal equilibrium with appreciable number densities and 
contributing to the entropy, although all particles contribute to the energy density. 
In reality, RT will show small discontinuous changes as the temperature drops 
below the mass of particular particle species. Subject to this caveat, equation 
(1.53) for the time dependence of the scale factor now implies the following 
connection between temperature and time for a radiation-dominated universe: 


T «xt7'/2 for radiation domination. (1.107) 


The constant of proportionality in this equation may be calculated from the 
Friedmann equation. When RT is a constant, 


D-O um 


and, using (1.103), the Friedmann equation (1.34) may be rewritten as 


Transition from radiation to matter domination 19 


. 2 2 
T Sx Gy x 4 
L| = NT 1.109 
(=) 3 30° (1.109) 


where we have neglected the cosmological constant and the curvature term, as in 
section 1.5. This has solution 


1/ 90 4,72 
17 5 (oz) MpPT? (1.110) 
* 
~ LSIMPN, ÊT. (1.111) 


If, for example, the appropriate N, for T above 100 GeV is that of the SU(3) x 
SU(2) x U(1) standard model or that of the supersymmetric standard model, 
then 

N=% o 3B (1.112) 


respectively. 
Equations (1.54) and (1.106) imply the following connection between 
temperature and time for the matter-dominated, universe: 


-2/3 


Tat for matter domination. (1.113) 
For a matter-dominated universe, 
T 3 
p(T) = 3M; HiQo (.) (1.114) 
0 


where we have used (1.56), (1.106), (1.58) and (1.40). Using (1.108), the 
Friedmann equation (1.34) may be rewritten as 


~.2 3 
T 2 T 
=] = — 1.11 
(7) nios (7.) (H1) 
with solution T 
2 1/2 ~1 T ) 
= -(Hı — . 1.11 
t 3 ( A ) To ( 6) 


1.9 Transition from radiation to matter domination 


As we have seen in (1.49) and (1.50), the energy density of radiation decreases as 
R~‘ as the universe expands whereas the energy density of matter decreases as 
R^. Thus, radiation domination gives way to matter domination at some point 
in the expansion of the universe. For a matter-dominated universe, the energy 
density is given by (1.114) and for a radiation-dominated universe by (1.103). 
However, there is a subtlety in the interpretation of N, which must be taken into 
account. We shall assume that the transition temperature is sufficiently low that 
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the only relativistic particles are the photon and three neutrinos. Neutrinos drop 
out of thermal equilibrium below about 1 MeV when the (weak) interaction rate 
that keeps them in thermal equilibrium becomes less than the Hubble parameter. 
(See section 5.2.) When the temperature drops below the electron mass (about 
0.5 MeV), electrons and positrons annihilate via e*e^ — yy, and the entropy of 
the electron-positron pairs is transferred to the photons. However, no entropy is 
transferred to the neutrinos, which are now decoupled. Before electron-positron 
annihilation, we have 
N.=2+3= 4 (1.117) 
but afterwards we should take 
N, =2. (1.118) 


Note that we are not keeping any contribution from the neutrinos because they 
have now dropped out of thermal equilibrium and no longer contribute to the 
entropy. If Ty; and Tyf are the photon temperatures before and after electron- 
positron annihilation, conservation of entropy requires that 


llr3 _ 473 
HT3 = 273, (1.119) 


T. 114,12 
w -() — (1.120) 
yi 


However, the neutrinos do not share in this temperature increase. Thus, there is 
an effective N, for the purpose of calculating the energy density of radiation 


so that 


Noe = 2+ 4x 6x Q1 ~ 3.36. (1.121) 


Note that the relativistic neutrinos still have an energy density that varies as R~4 
as the universe expands, because their number density varies as R^? and they 
undergo redshifting of their energy as R~'. Thus, the neutrino energy density 
also varies as T^, where T is the photon temperature, i.e. the temperature in the 
usual sense. Then, from (1.103), in the radiation-dominated universe below the 
temperature at which e*e^ annihilation occurs, 


n? 
30 


If the matter energy density of (1.114) and the radiation energy density 
(1.103) are equal at a temperature Teq, we find that 


p(T) = — N,enT^. (1.122) 


90M? HN 
Tey = PS. (1.123) 
UT Naeff To 
Using (1.65), (1.121) and (1.40) gives 


Teq = 5.682oh? eV. (1.124) 
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With A given by (1.66) and Qo by (1.42), we have 
Teq = 3 eV. (1.125) 


This justifies the original assumption that the only relativistic particles are the 
photon and three neutrinos. 


1.10 Cosmic microwave background radiation (CMBR) 


During the radiation-dominated era, the photons were in thermal equilibrium 
with matter (at the same temperature) because of interaction with the charge 
of the electrons and protons. We are making the approximation here that all 
baryons in the universe at this time are in the form of protons. However, 
eventually the electrons and protons combine into neutral atoms. (This is referred 
to as ‘recombination’.) Thereafter, photons decouple from matter and evolve 
at a temperature different from matter. In this way, black-body radiation at 
the recombination temperature develops into black-body radiation in the present 
universe at a lower temperature, because the temperature is proportional to the 
mean photon energy and the energy of the photons has redshifted with the 
expansion of the universe. Thus, for the photons, 


T ~ REA. (1.126) 
The recombination temperature Tec may be estimated to be 
Tree = 3575 K = 0.31 eV. (1.127) 


Here, recombination has been defined as the point at which 90% of electrons 
have combined with protons. (See, for example, section 3.5 of Kolb and Turner 
in the general references.) We may calculate the time of recombination trec from 
(1.116), noting that the universe has been matter dominated from the time of 
recombination until the present by comparing (1.127) with (1.125). Using (1.42), 
(1.65) and (1.66), with 7p = 2.73 K, gives 


trec ~ 1.89 x 10° yr. (1.128) 


1.41 Big-bang nucleosynthesis 


In chapter 4 we shall discuss possible explanations of the ‘observed baryon 
asymmetry of the universe’: 


n= Æ ~6.4x 107! (1.129) 
n 


x 
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where ng and ny are the present number densities of baryons and photons. 
Before doing this, it is important to understand the origin of this number whose 
explanation has been and remains a major topic of research. For this reason we 
shall outline here how this number emerges from measurements of the present 
abundances of the light elements, specifically deuterium (D), helium (?He and 
4He) and lithium (Li), Light elements such as these and also tritium (T or 3H) 
and beryllium (7Be) were formed in a primordial nuclear reactor. We shall see 
that the process begins towards the end of the ‘first three minutes’, as the era was 
so memorably described by Weinberg [5]. The first step is the formation of the 
A = 2 nucleus deuterium via the process 


np > Dy (1.130) 


conventionally written by nuclear physicists as p(n, y)D. At earlier times, the 
process goes in both directions. However, since there are more than 10° photons 
for every nucleon in the universe at that time, any newly formed deuterium is 
dissociated before it gets a chance to capture a neutron or proton and begin 
building heavier nuclei. Thus, no appreciable deuterium density accumulates. 
This ‘deuterium bottleneck’ persists until there are too few sufficiently energetic 
photons to dissociate the deuterons before they can capture nucleons. The A = 3 
nuclei He and 3H are then formed via 


D(py)'He:  pD- Hey (1.131) 
D(D,n)*He: | DD Hen (1.132) 
3He(n, p 3H: n?He > 3Hp (1.133) 
and 4He via 
T(D, n) ^He : DT > ‘Hen (1.134) 
3He(D, p)*He: | D?He-» ‘Hep. (1.135) 


Since there are no stable A = 5 nuclei, the synthesis of heavier nuclei requires 
the *He nuclei to interact with D, 3H or 3He, all of which are positively charged. 
The Coulomb repulsion suppresses the reaction rates for such processes, thereby 
ensuring that virtually all of the neutrons available for primordial nucleosynthesis 
wind up in *He, the most tightly bound of the light nuclei. Subsequently, the 
processes T(*He, y) "Li, 7Li(p, *He) tHe, ?He(*He, y) "Be and 7Be(n, p) "Li 
form more 4He and also small amounts of lithium and beryllium. 

The first process p(n, y)D is crucial, since an appreciable deuterium 
abundance must be built up before the others can proceed; the neutron and proton 
number densities are too low to allow the build-up of the other nuclear abundances 
by direct many-body processes. Clearly, the original abundance of neutrons and 
protons determines the light element abundances generated by these primordial 
processes. However, light elements are also created and destroyed in stars, 
supernovae and other astrophysical phenomena. Consequently, the light element 
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abundances measured today differ significantly from those created in the first 
three minutes. Because of this, the primordial abundances can only be inferred 
from the observational data after corrections to allow for the effects of galactic 
chemical evolution. The discussion of these is beyond the scope of this book. 
Here we shall instead focus on the essential physics of the primordial processes. 
Following Bernstein et al [6] and Sarkar [7], we shall present a semi-analytical 
treatment that allows the *He abundance to be calculated quite accurately. The 
precise calculation of this and the other yields and, hence, of the present baryon 
asymmetry requires a detailed numerical analysis which also wil! not be presented 
here. 

At sufficiently high temperatures (above a few MeV) neutrons and protons 
are in kinetic and chemical equilibrium with, as follows from (1.129), a very 
high value (O(10!!)) of the entropy per nucleon. During this era the equilibrium 
nuclear abundances are quite negligible. The first stage of nucleosynthesis is the 
freeze-out of the weak interaction processes 

nv. e pe^ net © pře n € pe ve (1.136) 
that previously kept neutrons and protons in equilibrium. Kinetic equilibrium 
requires equality of the temperatures of the particles: 


Tn = Tp = T, T, - T (1.137) 


and chemical equilibrium requires that the chemical potentials of the various 
species satisfy 


Un — Lp = He- — By, = Hig — Het: (1.138) 


The total rate A45 for converting neutrons to protons via these processes is the 
sum of the individual rates: 


Anp = A(nve — pe^) + A(ne* — pie) + A(n > pe^ ve) (1.139) 


and the total rate À pn for the reactions that convert protons to neutrons is given by 
detailed balance 


Apn = Ànpe 27/T where Am = ma — mp = 1.293 MeV. — (1.140) 


(The difference arises because of the slightly different Boltzmann factors in the 
neutron and proton equilibrium densities, see (4.19).) Let us denote the fractional 
relative neutron abundance by X, = n,/ny, where n, is the neutron number 
density and ny is the total nucleon number density ny = na + np where np 
is the proton number density. Then the fractional relative proton abundance is 
Xp = Np/nn = | — Xn. The evolution of X, is determined by the balance 
equation 


Xn = Apa (1 — Xn) — AnpXn- (1.141) 
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The equilibrium solution, found by setting X, = 0, is 


Ap) | 1 


edo — 
Xn (0 = A(t) 1+ eda/TO 


where A = Apa Amp. — (1.142) 


Thus, we may rewrite (1.141) as 
Xn = —A(X, — Xp). (1.143) 


This shows that X, is always between its initial value and X5. At early times 
A is large compared to the rate of time variation of the individual rates and X, 
quickly tracks its equilibrium value X m (f). This persists until the scattering rate 
A (t) decreases until it becomes comparable with the Hubble rate H(t) = R /R= 
—T/T. At this point, because of the expansion of the universe, the nucleons 
become too dilute to maintain the chemical equilibrium, they decouple and the 
number densities become ‘frozen’ at the values they have at decoupling. Thus, 


1 


rw (1.144) 


Xn (aec) = Xn (aec) = 

As explained in section 5.2, the decoupling (or freeze-out) occurs when the 
temperature is 

Tae ~ 1 MeV. (1.145) 


It is a remarkable coincidence that these two numbers, Taec and Am (given in 
(1.140)), are of the same order. The former derives from the interplay between 
the weak and gravitational interactions, while the latter derives from the difference 
between the u and d quark masses, which is of unknown origin but presumably 
as a result of strong and electromagnetic effects. Because of this coincidence, a 
substantial fraction (of order 20%) of the neutrons survive and this, in turn, results 
in a significant amount of promordial helium formed in the early universe. 

This calculation of the fractional relative abundance of neutrons when the 
weak interactions decouple is only a rough estimate. For a more accurate estimate, 
we must solve the balance equation (1.141). It is convenient to use the variable 
y = Am/T instead of t. In this era, the temperature T is related to the time t by 
equation (1.111) with N, — 3.36, as shown in (1.121). Using (1.140), the total 
decay rate is 

AQ) = Ap XE +e?) (1.146) 


neglecting the neutron decay rate compared to the scattering rates. Bernstein et 
al [6] have approximated As" (y) by 


_ a 
Anp(y) ~ 2A (nv, — pe^) ~ Zyl? + e») (1.147) 
n 
where a ~ 253 and t, ~ 887 s. is the neutron lifetime. Then the solution is 


y t 
Xn(y) = X8) + I dye" IX 6/1. y) (1.148) 
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where 


dr” 


y 
Iy, y) 7 a| - f dy” dy” 


= exp[K(y) - K(y’)]. (1.150) 


Anp(y" Y(1 + 24] (1.149) 


Using (1.147) gives (exercise 4) 


b 
Ko) = Gib 3 + y? + (4+ ye] (1.151) 


baa |0 _3MP — (1.152) 
Na NTa (Amy 


The required integral is easily evaluated numerically and gives 


where 


Xn(y — œ) + 0.15. (1.153) 


This asymptotic value is essentially achieved when y ~ 5 corresponding to 
t ~ 20 s, and a temperature of T ~ 0.25 MeV. 

The next stage of the process is the formation of deuterium. The rate for the 
process np — Dy exceeds the expansion rate of the universe until temperatures 
of order 107? MeV, so that deuterium will be present in this epoch with its 
equilibrium abundance. Using the non-relativistic number densities, analogous 
to (4.19), gives the Saha equation for the deuterium abundance n p 


= — e (1.154) 
MaNp  NaNp \mampT 


where Np = 3and Np = Nn = 2 are the statistical factors, defined in section 2.2, 
for the deuteron and nucleons, and 


Ap 2 mpm, — mp ~ 2.23 MeV (1.155) 


is the binding energy of the deuteron. Then the corresponding mass fractions 


x, = a (1.156) 
nN 
where An = Ap = | and Ap = 2 are, respectively, the mass numbers of the 
nucleons and deuteron, satisfy 
x 24¢(3) ( T V? 
2D n 245 (3) (—) je45/T (1.157) 
X,X p Jn. \mp 
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where 7 is defined in (1.129). A rough estimate of the temperature Tns at which 
nucleosynthesis starts may be made by determining when X p/ Xn X p becomes of 
order one. Taking logarithms of (1.157) gives 


Ap 3. Ap 
—— = — Inn + 6.27 + z In — (1.158) 
Tas 7 2 Ths 


which may be solved iteratively. With the (inferred) value given in (1.129), we 
get 


A 
Tas Ed ~ 0.068 MeV. (1.159) 


The temperature at which nucleosynthesis starts is so much less than the deuteron 
binding energy because n is so small. Since there are of order 1010 photons per 
nucleon, there are enough high-energy photons in the Wien tail of the Planck 
distribution to dissociate the deuterons until the temperature drops to much less 
than the binding energy. A more careful estimate can be made [6] using the 
rate equation for the deuterium abundance, with the onset of nucleosynthesis 
being defined by Xp = 0. This gives Tas ~ 0.086 MeV. Using (1.121), the 
temperature-time relation (1.111) gives 


T -2 
t> aa (s) S (1.160) 
so that nucleosynthesis begins when 
tns ~ 178 s (1.161) 


as immortalized by Weinberg [5]. The neutrons that survived when the weak 
interactions decoupled have been depleted by beta-decay during the intervening 
period. Thus, the relative abundance of neutrons surviving until the onset of 
nucleosynthesis is 


Xn(tns) = Xnly > oo)e n ~ 0.12. (1.162) 


As explained earlier, nearly all of these neutrons wind up in *He, because of its 
large binding energy. If we assume that all of the neutrons are captured in *He, 
the mass fraction of primordial *He, denoted Y, (*He), is simply given by 


Y, (He) ~ 2X, (tns) ~ 0.24 (1.163) 
in excellent agreement with the data [8] 
0.214 < Y, (He) < 0.242. (1.164) 


The foregoing calculation shows how the primordial “He abundance is 
determined by the baryon asymmetry 5. In principle, the same calculation 
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determines 7 once the helium abundance has been measured. However, since 
Yp (He) depends only logarithmically on 7, as is apparent from (1.158), this does 
not yield a very precise value for 5. In contrast, when similar techniques are 
applied to the (much smaller) primordial abundances of the other light nuclei, 
specifically D, ?He, and "Li, the dependence upon n becomes a power. It is a 
testament to the success of the standard cosmological model that a single value 
of 7 simultaneously fits the data on all primordial abundances and allows a much 
more precise determination of the parameter. The value (1.129) of the parameter 
that we use in chapter 4 is obtained from a simultaneous fit of the cosmological 
parameters to all of these and other data, including the recent WMAP microwave 
anisotropy data [1]. 


1.12 Exercises 


|l. Show that the radial path from coordinate (r, 0, $) at t = 0 to coordinate 
(0, 0, $) at time t is a geodesic. 

2. Verify that the Ricci tensor for the Robertson- Walker metric (1.1) has the 
non-zero components given in (1.30). 

3. Verify that the u = 0 component of (1.43) gives (1.45). What information is 
provided by the 4 = i components? 

4. Fora flat FRW universe with pressureless matter and a negative cosmological 
constant A, show that the universe collapses to a point singularity in a time 


T= 


E 


31A 


5. Show that the solution of the balance equation (1.141) for the relative neutron 
abundance has the form (1.148) with the integrating factor as given in 
(1.150). Using the approximation (1.147), verify that the function K (y) is 
given by (1.151). 


1.13 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e Kolb E W and Turner M S 1990 The Early Universe (Reading, MA: 
Addison-Wesley) 

e Weinberg S 1972 Gravitation and Cosmology: Principles and Applications 
of the General Theory of Relativity (New York: Wiley) 

e Weinberg S 1989 Rev. Mod. Phys. 61 | 

e Sarkar S 1996 Rep. Prog. Phys. 59 1493, arXiv:hep-ph/9602260 
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Chapter 2 


Phase transitions in the early universe 


2. Introduction 


Elementary particle theory possesses gauge symmetries that are spontaneously 
broken by scalar fields belonging to non-trivial representations of the gauge 
group when these fields develop non-zero expectation values at the minimum 
of the effective potential. In particular, the SU(2);, x U(1)y gauge group of 
electroweak theory is spontaneously broken to the U(1)em of electromagnetism 
by the electroweak Higgs scalar expectation value. If grand unification to a 
group larger than the SU(3), x SU(2), x U(1)y of the standard model, e.g. 
to SU(5), occurs at some energy scale, then the grand unified gauge group breaks 
spontaneously to the standard model gauge group before this gauge group in turn 
breaks to the U(1)em gauge group. Things may be more complicated than this, 
with a sequence of spontaneous symmetry breakings to subgroups of the original 
grand unified group. 

As we shall see later in this chapter, finite temperature effects may result 
in some other minimum of the effective potential being deeper than the absolute 
minimum of the zero-temperature theory. Then, as the universe cools, it may 
undergo a series of first- or second-order phase transitions between different 
minima of the effective potential. Such transitions will occur at temperatures 
corresponding to the scales of energy associated with the various spontaneous 
symmetry breakings. In the case of the electroweak phase transition to the phase 
with only U(1)em unbroken, the scale of temperature for the phase transition will 
be of order 107-10? GeV and, in the case of a grand unified phase transition to a 
phase with SU (3), x SU (2), x U(1)y unbroken, the phase transition will occur 
at a temperature closer to the Planck scale, perhaps at about 10!6 GeV. If the grand 
unified theory breaks to SU (3). x SU (2); x U(1)y in stages through a sequence 
of phase transitions, the additional phase transitions will occur at intermediate 
scales. 

In later chapters we shall see that these phase transitions can have a profound 
effect on the history of the universe through a number of different processes. For 
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example, topologically stable objects such as domain walls, cosmic strings and 
magnetic monopoles can be formed when the ‘alignment’ of the spontaneous 
symmetry breaking expectation value is different in adjacent causal domains. 
These can make substantial contributions to the energy density of the universe. 
Moreover, if supercooling occurs before the phase transition is completed, the 
reheating that takes place when the phase transition occurs can greatly modify 
pre-existing particle densities. In addition, if the universe spends some time with 
positive vacuum energy (cosmological constant) before relaxing to a minimum 
with zero vacuum energy, then rapid expansion can occur. Such an ‘inflationary’ 
stage in the history of the universe, to be discussed in later chapters, may explain 
the extreme isotropy, homogeneity and flatness of the present day observed 
universe. For all of these reasons it is important to understand any phase 
transitions that may have occurred as the universe cooled. 

In this chapter we shall begin by developing the partition function and 
the effective potential for the gauge field theories at finite temperature [1-5] 
before applying these methods to the Higgs model, as a warm-up, and then to 
electroweak theory and grand unified theory. In each case, the nature of the phase 
transitions that occur as the temperature of the universe drops will be studied. 
We shall then extend the discussion to gauge theories with global supersymmetry 
and local supersymmetry (supergravity). Finally, the nucleation of (stable) ‘true’ 
vacuum from (metastable) ‘false’ vacuum in first-order phase transitions will be 
considered. This nucleation rate will control the extent of any supercooling that 
occurs before the phase transition is complete. 


223 Partition functions 


One of the fundamental objects in the statistical thermodynamics of a finite 
temperature system is the partition function Z defined by 


Z = Tre P8 Q.1) 
where # is the Hamiltonian operator and 
B- OT) =T (2.2) 


in units where the Boltzmann constant kg is set equal to 1. The trace in (2.1) 
means that we are to sum over the (diagonal) matrix elements of e BA for all 
independent states of the system. Once the partition function has been evaluated, 
the (Helmholtz) free energy F is given by 


Z=e fF (2.3) 


where, as usual in thermodynamics, F is related to the internal energy E and the 
entropy S by 
F=E-TS. (2.4) 
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The pressure P and entropy are obtained from the free energy as 


oF 
P=- — (2.5) 
aV |r 
and 
aF 
S=-—|. (2.6) 
aT |y 
As follows immediately from (2.4), the energy density p is given by 
p=F4+Ts (2.7) 
where F and s are the free energy and entropy densities, with 
E= f dx p (2.8) 


and so forth. Thus, in particular, a calculation of the partition function will provide 
us with a determination of the energy density. 

The partition function in a gauge field theory is most efficiently calculated 
using path integral methods. It is not the business of the present book to develop 
such methods which can be found developed at length elsewhere [1-5]. It 
will suffice for our purposes here to to summarize the outcome for the various 
contributions to the partition function. 

The simplest contribution comes from the free (neutral) real scalar fields. 
The Lagrangian density for such a field $ having mass m is given by 


1(ae\? 1 1 
Lig ub) = 5 (2) - jv - me. (2.9) 


In field theory at finite temperature, scalar fields ó(t, x) are replaced by fields 
$ (1, x) periodic in t with period B, 


o(t =0,x) -ó(r = B, x) (2.10) 


where 
T — kf. (2.11) 


We shall use the usual convention of referring to non-zero temperature as "finite 
temperature’. The partition function is formulated in terms of these periodic fields 


as 
- B - 
z- wep f Dee] f dr f ox £e. 3,6] (2.12) 
periodic 0 


= 0 (3$ 


where 
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and Ñ (f) is a temperature-dependent normalization. The integral f D is a path 
integral. Such integrals may be thought of as the generalization of an integration 
fA, dvi A5, dya ... [55 dyn over the finite number of components of an n- 
component column vector y to an integration over the continuous infinity of 
components of a function $(r, x). Evaluation of the path integral gives for the 
contribution of a real scalar field to the free energy 


—BF =1nZ 


d? 
= - fef (Eve? +m? +inf — exp (s P «n. 


(2.14) 


When the mass of the scalar field is negligible compared with the temperature (an 
ideal ultra relativistic gas of scalar particles), the free energy density simplifies to 


z?T* 
90 


when T > m. (2.15) 


For gauge vector bosons, there are some subtleties because, for a typical 
choice of gauge, the Lagrangian involves all four degrees of freedom of the gauge 
field A" (x) and also involves the Fadeev-Popov ghost fields which occur in the 
construction of a consistent renormalizable theory but are not physica! particles. 
However, a massless vector field has only two degrees of freedom and the extra 
degrees of freedom are not physical and cannot be in equilibrium with a heat bath 
nor, of course, can the Fadeev—Popov ghosts. Fortunately there exist gauges in 
which each gauge field has only two degrees of freedom and in which there are no 
Fadeev-Popov ghosts and the partition function can be related to the Lagrangian 
density in such a gauge. In any other gauge, it can be shown that one may continue 
to use this expression for Z but with the form of the Lagrangian appropriate for 
that gauge. In an arbitrary gauge the contribution of the two non-physical degrees 
of freedom of the gauge field cancels the contribution from the Fadeev-Popov 
ghosts. Then, the contribution to the free energy density from a massless vector 
gauge field is found to be 

2x?T* 
F=- 90 ^ (2.16) 

In the case of Dirac fields y, the corresponding development at finite 
temperature involves fields y(t, x) that are anti-periodic in r in the interval 
(0, B), 

v(t =0,x) = —v(t = f, x) (2.17) 


and the contribution to the free energy density is 


In?T* 
180 


when T > m. (2.18) 
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For massless fermions (with only one helicity state of the particle), the calculation 
(using Weyl spinors) gives half of this answer. 

Putting all of this together, the free energy density of an ideal ultra relativistic 
gas (T > m) is given by 


7 z?T^ 
F=-IN -NF | — 2.19 
( B+ 3 r) 90 (2.19) 


where Ng and Nr are, respectively, the numbers of bosonic and fermionic degrees 
of freedom. (Ng = 1 fora real scalar field, Ng = 2 for areal gauge field, Nr = 4 
for a Dirac particle where there are two helicity states for the particle and two for 
the antiparticle and NF = 2 for a Weyl field.) The pressure, entropy density and 
energy density follow from (2.5), (2.6) and (2.4). 


7 z?TÁ 
={N — —— . 
( a+ QN) 90 (2.20) 
7 2z?T? 
= iN, -NF | —— 2.21 
s (Wo +3 r) 45 (2.21) 
and 
7 z?T^ 
={N -NF | ——. 2.22 
p (vos ,) 30 (2.22) 


2.3 The effective potential at finite temperature 


In quantum field theory at zero temperature, the expectation value @, of a scalar 
field $ (also referred to as the classical field) is determined by minimizing 
the effective potential V(¢,). The effective potential contains a tree-level 
potential term, which can be read off from the Hamiltonian density, and quantum 
corrections from various loop orders. The one-loop quantum correction is 
calculated by shifting the fields $ by their expectation values ¢, and isolating the 
terms Zquad($c. $) in the Lagrangian density which are quadratic in the shifted 
fields @. If we write 


V($c) = Vo(&c) + Vi(&c) (2.23) 


where Vo is the tree-level contribution and V; is the one-loop quantum correction 
then, for a single scalar field, 


ee( - i f dfx V e») - J Dó exp ( f d‘x E) (2.24) 


where, as in section 22, f Dó, denotes a path integral. (The derivation and 
evaluation of (2.24) can be found elsewhere [1].) 
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At finite temperature, as discussed in section 2.2, scalar fields $(t, x) are 
replaced by fields ø (r, x) periodic in r with period B, where f is given by (2.2). 
We now write the finite-temperature effective potential V(¢,) as 


V (bc) = Volge) + Vilde) (2.25) 


where Vo and V; are the tree-level and one-loop terms and the expectation value 
$. is now a thermal average. Then (2.24) is modified to 


8 B 
exp (-[ dr [6 Vi e) -f De exo f ar f x Laue d). 
0 periodic 0 

(2.26) 
If gauge fields and fermion fields are included (but with only scalar fields being 
given expectation values to avoid breaking Lorentz invariance), then (2.26) also 
contains path integrals over the gauge fields and their associated Fadeev-Popov 

ghosts, and over antiperiodic fermion fields. 
It is convenient to separate the one-loop terms into the temperature- 
independent part vp (which is identical in form to Vi) and the temperature- 


dependent part vi and write 
YWave+ve (2.27) 
In general, for a theory involving scalar fields ¢;, gauge fields Af and Dirac 


fermions w,, after shifting the scalar fields by their expectation values, the terms 
in the Lagrangian of quadratic order in the fields are of the form 
Lavad (Ge, 9) = — HME i + 3MQ bc)labA? Abu 
- (Mr (e)ls Pr Ws + 33,09" d; 
— 1(0^ A; — 8" AY) (8, Aa, — 9, A?") 
l - - - 

- zg uaa” + pnd" na. (2.28) 
In (2.28), ġe denotes the complete set of expectation values of scalar fields, ĝi 
denotes the shifted scalar fields and ð; is as in (2.13). Also, nq are the Fadeev- 
Popov ghost fields which have to be introduced in the construction of a consistent 
renormalizable theory of gauge fields but do not correspond to physical particles 
and & is the gauge-fixing parameter. For convenience, we have adopted the 
Landau gauge £ — 0 in which couplings of scalar fields to Fadeev-Popov ghosts 
are avoided. ` . 

If the eigenvalues of the mass-squared matrices M, Mi and M? are 

(M2);,(M2)q and (M2), then the temperature-dependent one-loop term in the 
effective potential vi takes the form 


EN T4 oo 
veo z; f er[xw[ — exp (— term) 
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+ >> (3 In [ — exp (-v? + rx.) — In(1 — e») 
—4 > In | T exp (-v¥ + r-x93)| l. (2.29) 


ull o pT. . . "M un 
There are two limits in which V, is particularly simple. First, in the limit 
when all mass-squared eigenvalues are very much greater than 7? all terms in 


vi approach zero exponentially and vi becomes negligible. (It is not obvious 
that this is true of the In(1 — e^?) term in (2.29). However, in a general gauge, 


this term is replaced by In[1— exp(—,/ y? + —T-2(M2)q)1, where & is the gauge 


parameter. If the limit T(Mi) — oo is taken before the limit E — 0, to 
recover the Landau gauge, this term vanishes.) 

Second, in the high-temperature limit where T? is very much greater than 
the mass-squared eigenvalues, we may use 


T^ oo 
3 dy y? In | — exp (-/» + 23] 
0 


T+ RT? RPT R (52) 
£ 
(SQUE) ( R 
ae $C Cen Miri 


as = 16m? In(3—2ye) Inap = 5.4076 (2.30) 


T* (9. 2 [ja -2 
zi], dy y in| 1 + exp (- y* + RT 


OTT R? R (5) 


320 ^ 48 en?” 
£ 
EV ESY PE l R 
EX Gap 7? Eta) Gaza 


where 


ay = x? In (;-m)- 5 Inap = 2.6351. (2.31) 


Thus, in the high-temperature limit, 


Vi j~ ZU (w Ne) 
1 (bc) > - 90 Bg F 
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+5 exo +3 2 MD. +2 Lur] 
- a. Eu +3 Loh] + 


274 
nT 7 
=— (ws + NF) 


90 
T? "2 72 2 
+ ggl" Ms (bc) + 3tr My (¢c) + 2 ME (¢c)] 


- lear? HIM (GP 4... (2.32) 


where M 2 (bc), M2 v ($c) and M2 p (bc)are the scalar, vector and Dirac fermion mass 
matrices of (2.28). "For fermions described by Weyl spinor fields there should be 
no factor of 2 in front of the M2 term in (2.32).) The T^ term in (2.32) is just the 
free energy density for an ideal ultra relativistic gas (in agreement with (2.19) with 
Ng and Nr respectively the number of bosonic and fermionic degrees of freedom, 
in the sense described following (2.19). If some fields are heavy and some are 
light on the scale of the temperature T, then Ng and Ny should be interpreted 
as the degrees of freedom of light fields, and the traces over the mass matrices 
should be evaluated only for light fields, since heavy fields do not contribute, as 
discussed earlier. 


2.4 Phase transitions in the Higgs model 


Before studying phase transitions in electroweak theory and grand unified theory, 
we warm up on the simpler case of the Higgs model. The Higgs model is the 
theory of a complex scalar field coupled to a U (1) gauge field, which may be taken 
to be the electromagnetic field, with the U(1) gauge symmetry spontaneously 
broken. In other words, it is scalar electrodynamics with spontaneously broken 
electromagnetic gauge symmetry. The finite-temperature Lagrangian density is 


- - À ]- - 
L = D pD" ġ* — m^g*ó — 0e» - 7 PF 


1 - - 1l 
- zz CAY + 8,n*8^n (2.33) 
where 
Fuy = dp Ay — 9A, (2.34) 
Dud = (Ou  ieA,)ó (2.35) 
and 


Dud" = (0, —ieA,)0* (2.36) 
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with 8, as in (2.13). The Fadeev—Popov ghosts 7 cancel contributions to the free 
energy from the two non-physical degrees of freedom of the gauge field A^, as 
discussed in section 2.2, and & is the gauge-fixing parameter. For spontaneous 
symmetry-breaking to occur (without requiring radiative corrections to drive it), 
m? must be negative. 

To derive the finite-temperature effective potential using the methods of 
section 2.3 it is necessary to shift the scalar field by its expectation value. We 
write 

($) = $e (2.37) 
V2 
where the factor of 1/ 42 has no significance but has simply been introduced for 
convenience, and ġe may be taken to be real because of gauge invariance. Then 
real fields $ and ¢2 are introduced through 


l . 
ġ= Wie + $1 + i$»). (2.38) 
The quadratic terms in the shifted Lagrangian density are 
l 3A 1 À 
Laud = -3 (m + 70) $1 -3 (v? * +02) $3 


2 
e | = 1 - 
+ 75 €A,A" + 5 Ge» + 5 9.02" 
] - - - 
- g 9.4" 8,0 8^n (2.39) 


where we have adopted the Landau gauge £ — 0 which removes an A“ Aygo 
cross term. Then, in the notation of (2.28), 


M2(g-) = diag (m + ^e. m? + 302) (2.40) 


and 
M?) = eg. (2.41) 


The nature of the phase transition depends on the relative sizes of e^ and A. 


2441 ed 


The tree-level contribution Vol $c) to the effective potential may be read from 
(2.33) by replacing @ by its expectation value Five and is 


2 
— À 
Vole) = 0 en. (2.42) 
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The zero-temperature one-loop correction v? of (2.27) and (2.23) has 
contributions proportional to A? and e* from different loop diagrams. Provided 


that A is small and that, in addition, e* « A, v may be neglected compared to 
the tree term (2.42). In the high-temperature limit, 


T? »44 ep -m (2.43) 
the one-loop temperature-dependent contribution to the effective potential vi 


obtained from (2.32) is given by 


4n? T’ + (A +3e?)T? 


5T 
V1 be) = “9G 24 


CT 
$- ER + (2.44) 


where 


4n C = [MEO + 3 M (917? 


3472 "VL 
- (v? 7+) «(v 7 +7) 43e (2.45) 


3a\3/2 aM 
- (3) +G) n (246) 


when Ad2 >> m?. Thus, the complete effective potential to one-loop order is given 

by 

4nx*T4 
90 

where a temperature-dependent mass m? (T) has been defined by 


Vig) =- 


1 CT À 
+ ;" 0€ — =e + ic* (2.47) 


à + 3e?)T? 

24 (A + 3e*) 
12 

The expectation value $c of the scalar field is obtained by minimizing the 


effective potential. For sufficiently high temperatures, there is only one solution 
of 


m^(T) =m (2.48) 


av 

ab =0 (2.49) 
namely 

oc —0 (2.50) 


and this is a minimum so long as m*(T) is positive. From (2.48), we see that this 
is the case provided the temperature T exceeds To where 


2. —12m? 
07 A-3e 


(See figure 2.1, curve A.) We may write 


(2.51) 
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Vig) 


— h 


Figure 2.1. The finite-temperature effective potential for the Higgs model when e* < A. 
Curves A,B,C are for T > Tj, T = Te and T < To respectively. 


2 
m^(T) = m? ( - 5) . (2.52) 
0 


A maximum and a second (local) minimum of V develop at non-zero values of 
c when 


272 
m2(T) < c (2.53) 
and this happens when the temperature drops below 7| where 
T2 —12Am? 
T? = —— L — = __— seo, 2.54 
1 dC Am? AA+ 3e) - 12C? > “0 (2.94) 
The second minimum is at 
1/2 
CT T? 
= v(T) = v | ——_ + [| l- — (2.55) 
9 NAT ( 5) 
where 
2 
, = 2m (2.56) 
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and the mass m y (T) of the Higgs particle associated with the fluctuations around 
this minimum is given by 


- 
my (T) = ae = CTw(T) - 2m*(T). (2.57) 
De lamur) 


In the same way, we may define a temperature-dependent vector boson mass by 
my(T) =ev(T). (2.58) 


When the second local minimum first arises, the global minimum of V is still at 
$c = 0. However, as the temperature falls, there is a critical temperature T, at 
which the second minimum becomes degenerate with the global minimum. (See 
figure 2.1, curve B.) This occurs when 


8C?T? = Am*(T) (2.59) 
which gives 


2 T2 91? —12Am? 
Te = omarion Lari aaah L aca CO 
1+8C T /9Am? 14+875/Tf AQ + 3e*) — 32C*/3 
so that Tj; > Te > To. At temperatures below the critical temperature, the 
minimum at non-zero @, is the global minimum of V and the system is in a phase 
with spontaneous symmetry breaking, referred to as the asymmetric phase. The 
value of œc at the global minimum changes discontinuously from ġe = 0 to 


(2.61) 


as the temperature passes through T = Te, so that there is a first-order phase 
transition. As the temperature falls below T = To, m?(T) becomes negative, the 
local minimum at ¢, = 0 turns into a local maximum and the only minimum is 
at the non-zero value of ġe = v(T). (See figure 2.1, curve C.) All of this occurs 
because C # 0. For future reference, we note that if C is zero or so small that 
v(T.) & v, then Ti ~ Te ~ To, and there is effectively a (continuous) second- 
order phase transition at temperature T = Te. 


2.4.2 et >i 


When the gauge coupling constant is larger relative to the $* coupling constant, 
there are two differences in the treatment required. First, the zero-temperature 
correction to the effective potential may no longer be negligible and, second, it 
may not be correct to make the high-temperature approximation that T is very 
much larger than the masses of all (shifted) fields. 
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If we now include the zero-temperature radiative correction, the complete 
finite-temperature effective potential becomes 


25 


2 
Vibe) = 7 — $2 + BO? "(s ^6 


«| + Vi (ce) (2.62) 


where M is arenormalization scale which may, if we wish, be eliminated in favour 
of the value of ¢, at the zero-temperature minimum of V and 


| (512 43,4 
= —; | zì . 2.63 
ban? (s + 3e ) (2.63) 


Renormalization has been carried out according to 


d? Y 
do? |a o 


= 3, (2.64) 


dom 2 


dg 


(Details of the derivation of the zero- temperature radiative correction may be 
found elsewhere [1].) If A is small and A € e^, then A^ is negligible compared to 
e* and B simplifies to 
-~ 3e* 
^ 6472" 
With the mass-squared matrices of (2.40) and (2.41) for the (shifted) scalar and 
vector fields, and not making the high-temperature approximation, 


4 oo 
VIGO = 74 Í dy a In | —exp (-/» ET + 2077] 
4 in | — exp (-V¥ + 7-2(¢m? + 77] 
+3in | — exp (^V yl req) - In(1— «]. (2.66) 


We now ask whether we should use the high-temperature approximation to 
study the phase transitions when ef >> A. If we do use the high-temperature 
approximation (and neglect the zero-temperature radiative correction for the 
moment) then, as before, the critical temperature is given by (2.60). If e? < 1 
(as in scalar electrodynamics), then, when e* >> A, we certainly have e? >> A and 
e)? » 13/2, so 


(2.65) 


Cx —. (2.67) 


and 


(2.68) 
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At the zero-temperature asymmetric minimum, still neglecting the radiative 
corrections, 
2 2 —4m? 


ġg=v= n (2.69) 
so that n 
T2 >m + "un =0 (2.70) 
and 3i 
T2 >m + A = —2m?. (2.71) 
However, when e* > A, 
—4m? e4 
2 2 _. 42,2 _ 
T, <my —e'v^ = OX (2.72) 


Therefore, it is not correct to use the high-temperature approximation for the 
vector boson terms when e* >> A. 

Taking account of this observation, we now compute the values of the 
effective potential at the symmetric and asymmetric minima to decide which is 
the absolute minimum when both exist. At the symmetric minimum, 6. is zero 
and the high-temperature approximation is valid provided only that T? > —m?. 
Thus, 
4x?T* 


E ST 
Vibe = 0) = Vj ($c = 0) = -— 


(2.73) 


At the asymmetric minimum, ġe = v, the contribution to Vi (bc) involving 
the gauge field mass ex is exponentially suppressed but the high-temperature 
expansion may still be used for the scalar field terms. Thus, 


2z?T* 4 T? 
90 24 


(—2m?) - so (amy (2.74) 


Vi (o. =v) > — m 


Dropping the zero-temperature radiative correction for the moment, 


= m* êT! mT? T 232 

Video = v) = yo Gs a To Cm d (2.75) 
where we have assumed that the value of the effective potential at the asymmetric 
minimum is the same as at zero temperature apart from the terms proportional to 
T^, T? and T. This can be shown to be correct apart from corrections of higher 
order in e?. Neglecting the m?T? term and the (—2m2)3/2 term compared with 
the T^ term, it can now be seen that the symmetric minimum is at a lower value 
of V than the asymmetric minimum when 


T> (=) jm| = Ta (2.76) 
x 
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Figure 2.2. Development of asymmetric minimum with temperature in Higgs models 
for 4124/11 > ef » AX. Curve A is at zero temperature, curve B is at Tp, the 
critical temperature for the first order phase transition, and C and D correspond to higher 
temperatures. 


This is illustrated in figure 2.2. 


The phase transition now occurs, in principle, at T = 7;; and, because the 
value of the expectation value undergoes a discontinuous change from O to v, the 
phase transition is now first order. As the system cools the phase transition to 
the asymmetric phase may not occur in practice until T is much less than 7,1 
because of the need to tunnel through the potential barrier between the symmetric 
minimum and the asymmetric minimum. 


When the zero-temperature radiative corrections are taken into account the 
situation can change. The effective potential of (2.62) can be recast (exercise 1) 
in terms of the value v of ¢, at the zero-temperature (asymmetric) minimum of 
the effective potential, including radiative corrections, as 


2 
Ve) = B (pe — et fin 3 VIGO (2.77) 


where 


a — 2B"! (25 — *) . (2.78) 
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Figure 2.3. Development of asymmetric minimum with temperature in Higgs models for 
eft x» 4124/11. Curve A is at zero temperature, curve B is at Tz, the critical temperature 
for the first-order phase transition and C and D correspond to higher temperatures. 


The physical Higgs scalar mass m y is given by 


= 2Bv*(4 — a). (2.79) 


The case of symmetry breaking at zero temperature driven by radiative 
corrections, discussed by Coleman and Weinberg [6], corresponds to a = 0, i.e. 
to 


m2, = m2y = 8Bv?. (2.80) 


When m2, « my (a > 0), the situation differs qualitatively from that just 
described because there is a local minimum of the effective potential at ġe = 0 
due to radiative corrections already present at 7 = 0. Then, we must take account 
of the zero-temperature radiative corrections in deriving the development of the 
effective potential with temperature. This is illustrated in figure 2.3. 


The case a > 0, where this is necessary, corresponds to 


e^» —n24. (2.81) 
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2.5 Phase transitions in electroweak theory 


In electroweak theory, the phase transition to be studied is often from a phase 
where the SU (2); x U (1)y gauge group is unbroken to one in which only U(l)em 
is unbroken, i.e. it is from a phase in which the gauge fields mediating the weak 
interactions are massless as well as the photon, so that the weak interactions 
are long range like the electromagnetic interaction, to a phase in which only the 
electromagnetic interaction is long range. The Lagrangian density for electroweak 
theory has a pure gauge field part 


Leauge = — 4 WA, WO” — 1B,, B^" (2.82) 
where the gauge-fixing term and the Fadeev—Popov ghosts have been omitted, 
Wa, = 0,We — au Wi — ge wh we (2.83) 


and 
Buy = 3, By — ð Bp. (2.84) 


In (2.82) and (2.83), Wp (a = 1,2, 3) are the three gauge fields associated with 
the generators of SU (2); and B, is the gauge field associated with U(1)y. The 
electromagnetic field is the superposition 


Ap = cos Ow By + sin Ow W3. (2.85) 


After spontaneous symmetry breaking to U(1)em, Ay, remains massless but the 
othogonal combination 


Z, = —sinOwB, + cosüwW; (2.86) 


together with 
+_wl4:y2 
Wi =W, +iW, (2.87) 
becomes massive. When A, is correctly coupled to the electromagnetic current 
with strength e, the Weinberg angle Ow obeys 


gsinQw = e = g'cosüw (2.88) 


where g and g’ are the gauge coupling constants for SU (2); and U(1)y. 
The SU (2); x U (l)y gauge symmetry is spontaneously broken by the Higgs 
doublet (under SU (2);.) introduced by writing 


H= ( on ) (2.89) 


with weak hypercharge Y = 1/2. The Higgs boson part of the Lagrangian density 
(including the couplings to the gauge fields) is given by 


Litiggs = (DH) (D^ H) - m Ht H — A(H' HY? (2.90) 
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where the covariant derivative of H is 
D,H = (0, ig $t, Wa ig 4B,)H (2.91) 


and Ta, a = 1,2,3, are the Pauli matrices. If the expectation value for the Higgs 
doublet is introduced by writing 


0 
H) = 2.92 
ub ( $2 ) (2.92) 
the zero-temperature tree-level term in the effective potential is 
2 
— m À 
Volge) = 7-42 + 7t- (2.93) 
Including the zero-temperature and one-loop contributions, and assuming that T? 


is large compared with all shifted masses, we have, for the complete effective 
potential, 


= ly 2 lang À 4 7 )5- 
=> -2 Zot- [ Ng - -N. 
V ($c) = 5m (T)9; 3€T9c + 7% B+ SNF) 
2 
af 92. 25 
with 
à e2(1 + 2cos? Ow) h? 

2 2 f 2 
m'(T) = m* + = +——5—— +} >= T 2.95 

V 2 4 sin? 20y 7 12 (2.99) 


where h y are the Yukawa couplings of the fermions, with the largest contribution 
coming from the top quark. Retaining only the gauge boson contributions to the 
$2 term, which is expected to be a reasonable approximation provided that the 
mass of the Higgs particle is not too much larger than the Z boson, we have 


c- 3e3(1 + 2cos? Ow) 


2.96 
4 sin? 20w (2.96) 


Dropping the A? contributions to the radiative corrections, which are always 
perturbatively negligible compared with the tree terms, 


2 
3 [ e 1 + 2cos* Oy l 2 
B=-[— —— — - =) . . 
(E) sin? 28 eu : hý (2.97) 


To study the nature of the phase transition, it is important to determine the sign of 
B. At zero temperature, (2.94) is of the same form as (2.62) with à replaced by 
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4) and can be cast in the form (2.77) with the same replacement in (2.78). A little 
calculation (exercise 2) shows that, when B > 0 anda > O, the zero-temperature 
effective potential always has a minimum at ¢, = 0, in contrast to the situation 
when radiative corrections are neglected where there is a maximum at ¢, = 0 and 
a minimum at $2 = —m?/4. 

However, when B < 0, there can only be a minimum at ġe = O ifo < 0. 
However, œ < Q is ruled out by the requirement that the mass-squared m3, 
of the physical neutral Higgs scalar particle given in (2.79) should be positive. 
Thus, there is never a minimum at ġe = 0 when B < O and the situation is 
then qualitatively the same as in the absence of the zero-temperature radiative 
corrections: the phase transition is first order. 

The sign of B may be determined as follows. We shall retain only the largest 
Yukawa coupling which is the top quark coupling A,. (Retaining more Yukawas 
only strengthens the conclusion.) The corresponding Lagrangian is 


Lrop  —hi QT it? Htr + hic. (2.98) 
where Q; is the SU (2), doublet 


t 
Q= ( b ) (2.99) 


Taking account of the three colours of top quark contributing to the loop, Y ; f G$ 
is replaced by 3A? in (2.97). The top-quark mass term deriving from (2.98) is 
hititrdc/V2 + h.c., so the top quark mass is 


1 
m, = ante. (2.100) 


Also, the W and Z masses are given at tree-level by 
mi, = 1962 m} = 1 sec? Owg"?. (2.101) 


With a measured top quark mass of about 175 GeV, and an empirical value 
of sin? Ow determined by the measured W and Z masses of 0.243, so that 
Qc ~ 263 GeV, we find that 

h; = 0.94, (2.102) 


With e2/4z = 1/137, we find from (2.97) that B is negative. 

We also have to decide whether it is correct to assume that T? is large 
compared with all (shifted) masses. Using the high-temperature expansion and 
neglecting the zero-temperature radiative corrections, we see from (2.95) that, as 
in (2.60), 


sin? 20w 


2 
1 |à e2(1 +2cos? 8w) h? e$ | 4-2cos? óy 
m?|2 4sin? 20y 4 Br? 


(2.103) 
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where, again, we have retained only the top-quark Yukawa coupling. Provided 
e? x» A >» eb, we estimate 
T2 ~ —2m*. (2.104) 


As in the Higgs model, the shifted masses for the Higgs scalars are of order 
—m? and the high-temperature approximation is not too unreasonable. The 
greatest danger for the high-temperature approximation in the Higgs model with 
et x» A arose from the gauge field mass. In the present case, dropping 4/2 
in the numerator of (2.103) and using the value of ¢2 at the zero-temperature 


asymmetric minimum ¢2 = —m?/A, we can estimate that 
T2 à T2 a 
> ~2-5 — x 1.55. (2.105) 
my € m; € 


Thus, if à > e? ~ 0.09, the high-temperature approximation may again be not 
too unreasonable. 

This discussion suggests that the electroweak phase transition is effectively 
second order, because C, defined in (2.45), is small in the sense discussed at the 
end of section 2.4.1. For T > 7., the system is in the symmetric phase in which 

c = 0 and all gauge bosons are massless. For T < T, the system is in the 
asymmetric phase for which ġe # 0, the W* and Z gauge bosons acquire a 
mass and the symmetry is broken from SU (2), x U(1)y to U(1)eg. The critical 
temperature 7, given by (2.103) is of the same order of magnitude as the zero- 
temperature value of ġe at the asymmetric minimum of the effective potential 
provided that A > e?. It was estimated before (2.102) that $c ~ 263 GeV and so 
T. should be of this order of magnitude. 


2.6 Phase transitions in grand unified theories 


Electroweak theory combines the weak and electromagnetic interactions in a 
single model with SU (2); x U(1)y gauge group but achieves no unification 
of these interactions with the strong interaction. It is possible that the weak, 
electromagnetic and strong interactions are unified in a theory involving a larger 
gauge group (a grand unified theory or GUT), perhaps with a single gauge 
coupling constant. Once such a unification has been assumed, the coupling 
constants g, g’ and gs (the QCD coupling constant) are related by group theory 
factors to a single GUT coupling constant gg for the grand unified group. The 
values of the renormalized coupling constants depend on the renormalization 
scale M and, if the coupling constants g(M), g’(M) and g, (M) obey the gauge 
theoretic relationships of the grand unified group at one such scale M = Mg, they 
cannot obey these relationships at lower energy scales. This is because at energy 
scales below Mg the extra gauge fields associated with the enlargement of the 
gauge group to the grand unified group may be ignored. (They acquire masses on 
the scale of Mc.) Then g(M), g'(M) and g, (M) run differently with M when the 
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renormalization group is deployed to derive their dependence on M. However, 
for energy scales greater than Mg, all gauge fields are on the same footing and 
there is a single gauge coupling constant gg developing in accordance with the 
renormalization group equation of the GUT. 

The simplest example of a grand unified group that is large enough to contain 
SU (3). x SU(2), x U(1)y of the standard model is SU (5) and we shall use this 
example to illustrate phase transitions in GUTs. If the renormalization group 
equations are used to run the low-energy values of the gauge coupling constants 
to the scale M = Mg at which the SU (5) relationships 


gs(Ma) = gMc) = J $8 (Mc) = &G(Mc) (2.106) 


hold, then upon inputting the values of the strong and electromagnetic coupling 
constants at M = mz, the unification scale is found to be of order 105 GeV. 
In addition, there is a prediction for sin? Ow (mz) at M = mz of around 0.21, 
which differs significantly from the observed value of around 0.23. Nevertheless, 
we shall use the SU(S) GUT as a simple illustration of the way in which phase 
transitions work in a GUT. In section 2.7, we shall consider supersymmetric GUTs 
in which the prediction for sin^(m z) can be brought into line with experiment to 
a high degree of accuracy. 

In the SU (5) GUT, the grand unified phase transition is from the SU (5) 
symmetric phase to the SU(3); x SU(2), x U(1)y symmetric phase and is 
followed at a lower temperature by the electroweak phase transition described 
in the previous section. We expect the critical temperature for the grand unified 
phase transition to be of order 10? GeV (the energy at which the spontaneous 
symmetry breaking occurs) and, at such high temperatures, the expectation values 
of the electroweak Higgs scalars (of order 200 GeV) are negligible. Thus, to 
describe the grand unified phase transition we need only retain the Higgs scalars 
responsible for breaking the SU (5) gauge group, whose expectation values are on 
the 1015 GeV scale. 

The grand unified Higgs scalars © belong to the 24-dimensional adjoint 
representation of SU (5): 


24 
b= > data (2.107) 
a=) 


where ft, are the SU(5) generators in the fundamental five-dimensional 
representation. Suppressing the gauge-fixing term and the Fadeev—Popov ghost 
term, the finite-temperature Lagrangian density (apart from a possible tr ©? term) 
is 


L= -m tr d? — A (tr?) — Az tr + (Dy)? — 4 tr(Fay F^"). (2.108) 
In (2.108), the covariant derivative D, is given by 


D, = 9,6 + igc(A,, ©] (2.109) 


50 Phase transitions in the early universe 


where 


24 
Ap = >> Aaula (2.110) 
a=} 


with Aa, the gauge fields in the adjoint representation of SU (5), and the transition 
to a finite-temperature theory is made by the replacement of 8,, by à,,, as in (2.13). 
The gauge field strength Fy, is given by 


with the transition to the finite temperature being made in the same way. The 
fermionic terms have been dropped in (2.108) because quark and lepton masses 
are negligible on the grand unified scale. 
For the breaking of SU(5) to SU(3)- x SU(2); x U(l)y, we take the 
expectation value of the field ® to be of the form 
$c 4. ( 3 ;) 

9$) = —-diag[ 1, 1,1, 2, —z }. (2.112) 

(D) Vis g 2'73 
(() must be traceless because the matrices ta are.) This can be shown to be the 
lowest energy state at zero temperature for 


7 
à > 739^ À2 > 0. (2.113) 


The finite-temperature effective potential can then be written, for temperatures 
large compared to all masses, as 


- 1 1 7 7 z?T* 
Vgc) = zmi(T)92 + - i + t- zNrF])—— 
($c) smi (Tbe «&( i+ za) (Na+ sN) 90 
2 
25 
4 
where 
2T ola 272 
mi(T) — mi + so (130% + 472 75g6)T (2.115) 
_ 25 4 
= 2562256 (2.116) 


and the zero-temperature radiative correction has been renormalized at mass M as 
in section 2.4. In (2.114) the A and ii contributions are always small compared 
with the tree terms in (2.114) and have been dropped. 

If g « A1, À2 we may neglect the zero-temperature radiative correction. 
There is then a second-order phase transition with critical temperature T, given 
by 

2 — 60m; 


= ——— r 2.117 
“ 1304 + 47A2 +7583 em 
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For T > Te, the system is in the SU (5) symmetric phase, for which ¢, = 0, and 
all gauge bosons are massless. For T < Te, $c is non-zero, the system is in the 
SU(3)¢ x SU), x U(1)y symmetric phase, and only the electroweak gauge 
bosons are massless. By the same sort of argument as in the previous section, Te 
should be of order 10!? GeV. 

However, if Be >> 1,42, then a discussion similar to that given for the 
Higgs model in section 2.4 shows that a first-order phase transition takes place. 
In the case of a GUT, this conclusion is not negated by Yukawa couplings of 
fermions giving additional contributions to the coefficient B, because quarks and 
leptons do not couple to the grand unified Higgses. This is an important difference 
compared with the electroweak phase transition. 


2.7 Phase transitions in supersymmetric GUTs 


If elementary particle theories possess supersymmetry, then each (complex scalar) 
spin-0 particle is paired with one chirality of a spin-5 particle in the same so- 
called ‘chiral supermultiplet' and each spin-1 vector particle is paired with a 
spin-5 particle of a single chirality in the same so-called ‘vector supermultiplet'. 
The quarks and leptons have supersymmetric partners referred to as ‘squarks’ 
and ‘sleptons’, the Higgs scalars have supersymmetric partners referred to as 
'Higgsinos' and the gauge bosons are paired with ‘gauginos’. In the absence 
of supersymmetry breaking, particles in the same supermultiplet have the 
same mass. Of course, since at the time of writing we have not observed 
supersymmetric partners of the known particles (‘sparticles’), there must be some 
(spontaneous) supersymmetry breaking to produce substantial mass splittings 
within supermultiplets. 

The presence of these extra sparticles can be very important for the 
discussion of phase transitions at temperatures large compared to the sparticle 
masses. In addition, the supersymmetry transformations transforming particles of 
different spin within a supermultiplet into each other strongly constrain the form 
of the Lagrangian and the tree-level effective potential, with further implications 
for phase transitions. These supersymmetry transformations may be local or 
global depending respectively on whether the parameters of the transformation 
do or do not depend on the point in spacetime. In this section the case of globally 
supersymmetric GUTs will be discussed [7-12] and the locally supersymmetric 
(supergravity) case will be discussed in the next section. (For a systematic 
development of globally and locally supersymmetric theories see [14].) 

In general, in a supersymmetric theory, the Lagrangian and the tree-level 
effective potential are determined once the superpotential W is given. For 
example, for a theory with a single scalar field ¢ together with its supersymmetric 
partner, the superpotential for a renormalizable theory takes the form 


W = imọ? + 149? (2.118) 
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the bosonic part of the Lagrangian density is 


aw |? 
Loosonic = 049! 0^ — ls (2.119) 
ag 
and the tree-level effective potential is 
aw |? 
V= Pd = mé + A¢g?|*. (2.120) 


In a supersymmetric SU(5) GUT, the generalization of this renormalizable 
superpotential is 
W = jmtro? + jiu? (2.121) 


with ® defined in (2.107), and the tree-level effective potential is 
V —luimó +a(o? - Jue? D + ga (o, 0]. (2.122) 


The last term in (2.122) is the so-called * D-term' that arises in a supersymmetric 
gauge theory and the first term, which is independent of the gauge group, is 
referred to as the ‘F-term’. At T = 0 (and in the absence of supersymmetry 
breaking), the effective potential (2.122) has degenerate minima (exercise 3) with 
V — 0, namely 


(>) =0 (2.123) 
m . 
(>) = z diag(1, 1, 1,1, 4) (2.124) 
and m 
(b) = n diag(2, 2, 2, —3, —3). (2.125) 


The minima (2.123), (2.124) and (2.125) correspond respectively to SU(5) 
symmetric, SU(4) x U(1) symmetric and SU(3) x SU(2) x U(1) symmetric 
phases. 

At finite temperature, the degeneracy of these supersymmetric minima is 
lifted by the T* terms and by T? tr(@*@) = Y^, $lġal T? terms. As in (2.32), 
the coefficient of the T4 term depends on the value of Ng + &NF for states 
light on the scale of the temperature T. For the SU(5) symmetric phase, the 
24 gauge fields together with their gauginos contribute 90 towards Ng + &NF. 
Each fermion generation has three doublets of left-chiral quarks, one for each of 
the three colours, six right-chiral quarks, a left-chiral lepton doublet, and a right- 
chiral (charged) lepton. These give ng = 3 copies of the 5 + 10 representation 
of SU(5). These three generations of quarks and leptons, together with their 
associated squarks and sleptons, contribute $5 to Np + iN. In total, this gives 


the coefficient of the T^ term in the temperature-dependent effective potential 
7 


? 23 
E (ws gnr) = -50 SU (5) symmetric phase. (2.126) 
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For the SU (3) x SU (2) x U (1) symmetric phase, the 12 gauge fields together 
with their associated gauginos contribute 45 towards Ng + &NF and, for the 
SU (4) x U (1) symmetric phase, the 16 gauge fields together with their gauginos 
contribute 60 towards Ng -- Np. In each of these two cases, the matter field 
content is the same as for the SU (5) symmetric phase and we find that 


x? 7 -19 5 
— | Ng + -NF | = —nr SU (3) x SU (2) x U (1) symmetric phase 
(2.127) 


2 7 —61 
T. (Ne + 1) — x? — SU(4)xU(l) symmetric phase. — (2.128) 


If a copy of 5 + 5 is included to provide the two electroweak Higgs doublets 
(plus Higgsinos) needed to give masses to both up-like and down-like quarks in 
a supersymmetric theory, then there is an additional contribution —57?/12 to 
(2.126) and —n?/6 to (2.127), in the latter case from the two SU (2); doublets 
that are all that survive from the 5 4- 5 after spontaneous symmetry breaking of 
SU (5) by the expectation values of the adjoint Higgs scalars. (The surviving 
adjoint Higgs scalar states are too heavy to contribute to the temperature- 
dependent corrections to the effective potential for temperatures below the grand 
unification scale.) In the SU (4) x U(1) phase, the complete 545 becomes massive 
and fails to contribute to the temperature-dependent corrections. The values of 
&5 (Np + 1Nr) in (2.126),(2.127) and (2.128) are then modified to 272, $b? 
and sin, respectively. Thus, the T^ term favours the SU(5) symmetric phase 
over the SU(3) x SU(2) x U(1) and SU(4) x U(1) symmetric phases, which 
remain on the same footing. This conclusion is only strengthened by the inclusion 
of the adjoint Higgs supermutiplet which provides extra light states in the SU (5) 
symmetric phase. If the theory contains more than one pair of Higgs multiplets 
coming from 5 + 5, then the SU (5) symmetric phase continues to be favoured 
over the other two phases but the SU(3) x SU(2) x U(1) symmetric phase is 
favoured over the SU (4) x U(1) symmetric phase, which is the assumption we 
shall make in what follows. 

Clearly, the SU (5) symmetric phase, for which the scalar expectation value 
is zero, minimizes the T?42 term as well as the T^ term. Thus, the theory 
appears to favour the SU(5) symmetric phase at all temperatures. However, 
at temperatures below 100 GeV-1 TeV the (non-perturbative) supersymmetry 
breaking mechanism will lift the degeneracy of the three phases more than 
the temperature-dependent terms and may favour the SU(3) x SU(2) x U(1) 
symmetric phase. At higher temperatures the temperature-dependent terms 
dominate. This suggests that the universe is in an SU (5) symmetric phase down 
to temperatures of 100 GeV-i TeV. 

This conclusion is modified by the running of the gauge coupling constant 
gs for SU(5) with temperature. This may result in gs becoming strong at 
temperatures of order 109-100 GeV. Then, confinement may result in fewer 
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massless states. (The SU(5) coupling becomes strong at a higher temperature 
than the SU(4) and SU(3) couplings.) Then one of the other two phases may 
become the absolute minimum and, eventually, tunnelling may occur to one of 
the other phases. In general, running coupling constants g,(j) and ga(M) at 
energy scales 4 and M are related by 


M? 
16229; ^ (u) = 1627g77(M) + b, ln (5) (2.129) 


where the renormalization group coefficient b; is given by 


a = -Me(G3) + $9 e) + 33 c2(Sa). (2.130) 
Ra Sa 


In (2.130), the group theory factor c1(G4) for the group Ga is related to the 
structure constants fagy by 


c1S0p = fays fBys Q.131) 


the group theory factor c2( R4) for the representation Ra of the group is given in 
terms of the matrices T, representing the generators of Ga in the representation 
Ra by 

c2dap = tr(To Tg) (2.132) 
and the summations are over chiral fermion representations Ra of Ga and scalars 


in representations Sa of Gg. For a supersymmetric theory, each gauge field is 
accompanied by a gaugino in the adjoint representation, so that 


-Ma(6,) 9 -Ma(G) + $e1(Ga) = —3c1 (Ga). (2.133) 
Also, each chiral fermion is accompanied by a complex scalar so that 
$e) > $¢2(Ra) + le) = c2(Ra). (2.134) 
Thus, in a supersymmetric theory 
ba = —3c1(Ga) + 9 c2(Ra) (2.135) 
Ra 


where the sum is over all chiral supermultiplets. Identifying the energy scale y 
in (2.129) with T, and recasting the renormalization group equations in terms of 
Qa = 82/47 , gives 


-l(T) = az (M) + 24 (7 
az (T) = a7 (4) + zn (7. J. (2.136) 


For SU (5), 
c(SUGS)) =5  ea52o525 o1u9-3. (2.137) 
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With ng = 3 generations in 5 + 10 and Ny sets of Higgs scalars in 5, we have 


bs = —9 + i Ny (2.138) 
and, for Ny = 2, 
bs = —8. (2.139) 
It is convenient to choose 
M = Mx 22x 10/6 GeV (2.140) 


which is the energy scale at which the low-energy (supersymmetric) SU (3) x 
SU (2) x U(1) coupling constants reach a common value so that grand unification 
may occur. At this scale, 

as(Mx) = xy. (2.141) 


If we take the criterion for the SU (5) coupling constant to become strong to be 
as(T) ~ 1, then the corresponding temperature is 


T c 6.5 x 10M ~ 108 GeV. (2.142) 


It is at this temperature that we expect that either the SU (3) x SU(2) x U(1) or 
the SU (4) x U(1) symmetric phase becomes the absolute minimum. Eventually 
tunnelling will occur to whichever of these phases is the absolute minimum. If it is 
the SU (3) x SU Q) x U (1) symmetric phase, then the universe will continue in this 
phase until the coupling constant g4 becomes strong at some lower temperature. 

In these globally supersymmetric theories, because the zero-temperature 
effective potential is zero when supersymmetry is unbroken, the cosmological 
constant is zero in each of the SU (5), SU (4) x U(1) and SU(3) x SU(2) x U(1) 
phases until supersymmetry breaking becomes non-negligible (with respect to T) 
for temperatures below 107-103 GeV. 


2.8 Phase transitions in supergravity theories 


Up to now we have been discussing theories with global supersymmetry. A 
theory with local supersymmetry is necessarily a theory which contains gravity 
(supergravity). The reason is that the supersymmetry algebra contains the 
generator P,, of translations and when we allow supersymmetry transformations 
that depend on the point in spacetime (local supersymmetry), we have to consider, 
among other things, translations that vary from point to point in spacetime. Thus, 
local supersymmetry contains general coordinate transformations of spacetime 
and so is a theory of gravity. 

In phenomenologically acceptable theories, the supersymmetry breaking 
scale M; is large (typically 10'9-10!! GeV) where M? is the expectation value 
of the auxiliary field of the scalar responsible for supersymmetry breaking. For 
example, in theories with F-term supersymmetry breaking, at tree level, a fermion 
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has a supersymmetric partner of lower mass than itself as well as one of higher 
mass than itself. Since this does not occur in the real world, it is necessary 
for there to be significant quantum corrections to avoid this problem, though 
not so big that the hierarchy problem is no longer solved. This is the origin 
of the large supersymmetry breaking scale. In these circumstances, the effects 
of quantum gravity can no longer be neglected. In particular, in the presence of 
supersymmetry breaking, scalar particles acquire masses of order M2/m p (where 
mp = 1.2 x 10!9 GeV is the Planck mass) which are of order 10? GeV. 

Once gravitational effects are important, we should allow not only that the 
superpotential may contain non-renormalizable terms but also that there may be 
non-renormalizable kinetic terms. Thus, for example, the scalar field kinetic terms 
take the form 

Ky 93" 95 (2.143) 

Gi 99; uc ` 
where K (d, $7) is referred to as the Kahler potential. It turns out [13] that the 
complete supergravity Lagrangian can be expressed in terms of 


G =K 4 InjW|? (2.144) 


apart from couplings to gauge fields, which involve the gauge kinetic function. It 
will often be convenient to work in units where the reduced Planck mass Mp — 1, 
where M p is defined by 

Mp z81GN (2.145) 


where Gy is Newton's constant, so that 

Mp ~ 244 x 1018 GeV. (2.146) 
In these units, the zero-temperature effective potential takes the form 

V = e8(Gi(G^ y,G) — 3) Q.147) 


where the scalar fields have been written as @;, their adjoints as $!* and derivatives 
of G as 


; 0G 0G 
i = — —_—_— 
Ge; $ Gi=5 " (2.148) 
and 2 
j 9^G 
G; = ——.. . 
j 89,097 (2.149) 
The inverse (G~')/, obeys 
(G^ ),G; = ô. (2.150) 


In particular, in the case of a single gauge-singlet chiral superfield «b with minimal 
kinetic terms arising from 


G = ¢*¢+1n(W/" (2.151) 
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we have ; 
* aw 
y =e"? (|/__ +w 
e (5s té 


- awe) (2.152) 


instead of V = |3W/34|? for the globally supersymmetric case, as in (2.120). 
Consideration of the supersymmetry transformation laws of scalar fields and 
their fermionic superpartners shows that the criterion for supersymmetry breaking 
is that AW/ad + @*W should be non-zero [14]. Thus, whereas the globally 
supersymmetric theory vacua with unbroken supersymmetry had V = 0, in the 
locally supersymmetric case (with minimal kinetic terms) supersymmetric vacua 
have 

y = —3e%"t wp? (2.153) 


More generally, we may consider gauge non-singlet chiral superfields 4;. 
In that case, the supergravity Lagrangian [13] also involves the gauge kinetic 
function fap. For the minimal choice of gauge kinetic function 


fab = Sab (2.154) 


the gauge kinetic term 
—4Re fab Fayv FE” (2.155) 


simplifies to —4 Fay Fe ” Then, with minimal kinetic terms, the zero- 
temperature effective potential takes the form 


V= e? ( 


89; 


2 
1 . 
- awe) + 38/6 (Ta)ij jG" Taug (2.156) 


where we have assumed a simple gauge group with gauge coupling constant g, 
and 


G =" + c. (2.157) 


If supergravity is unbroken, study of the supersymmetry transformation laws 
shows that we must have 


Ww 
G'(Ta)ij bj =0 and om +9o'*w=0 (2.158) 
i 
and supersymmetric vacua have 
V = —3e%* w]. (2.159) 


There is no longer any requirement that supersymmetric minima should be 
degenerate in energy at T = 0 nor that they should have lower energy than all 
other vacua. 

In the high-temperature limit, (2.32) still applies to the one-loop 
temperature-dependent correction to the effective potential provided that, in 
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the case that the kinetic terms are non-minimal, we first construct fields with 
canonical kinetic terms by field redefinition. This means that, for scalar fields ¢; 
(and their fermionic superpartners), we have to write 


$i = (G7) (jw (2.160) 


and for the gauginos 
da = (Re fab) (ew (2.161) 


where (¢;)w and (Ap)w are the normalized fields. The relevant mass matrices 
are obtained from the standard supergravity Lagrangian. The outcome [15-18] is 
particularly simple in the case of minimal kinetic terms. Then 


274 
T z*T 7 
- Ng + -N 
1 90 ( Bg r) 
T? c [3 » 
tie | (A+B) + (C+ NVC -2 450° +C-1} (2.162) 
where 
A= G'GijG! + GiGG; (2.163) 
B=GjG! (2.164) 
C=G;,G' (2.165) 


and N is the number of chiral superfields. In practice, N is often large. For 
example, if the matter field content is that of the minimal SU (5) GUT, there are 
ng = 3 generations in the 5 + 10 representation contributing 45 to N, two copies 
of 5 or 5 for electroweak Higgs contributing 10 to N and one copy of 24 for 
the grand unified Higgs scalars contributing 24 to N, leading to a total value of 
N — 79. 

Provided that all of the couplings in the superpotential are of the same order 
of magnitude, in units where the reduced Planck mass is 1, we can then take the 
large-N limit to obtain 


90 


1 => 


-T m T’ 
8 


7 T? , 
Ng + Ne) + N 56^ (GG -2). (2.166) 


Provided that the changes in vi generated by non-minimal kinetic terms do not 
introduce extra factors of N (which is true for most choices of G), this is still the 
large-N limit of v in that case. For any particular choice of Kähler potential and 
superpotential and, therefore, of G, the discussion of phase transitions proceeds as 
before with the modified finite-temperature corrections to the effective potential 
of (2.162). 


Nucleation of true vacuum 59 
2.9 Nucleation of true vacuum 


In sections 2.4 and 2.6, we have found that first-order phase transitions may occur 
as the universe cools. If the phase transition is first order, it will be necessary for 
the universe to tunnel out of the metastable minimum [19-21] (false vacuum) to 
reach the absolute minimum (true vacuum). If the tunnelling rate is small, this 
may occur at temperatures very much lower than the temperature To of (2.76) 
at which the energy of the zero- (and low-) temperature vacuum drops below 
that of the high-temperature vacuum. In what follows, we shall approximate the 
tunnelling rate by its T = 0 value and study, for simplicity, the case of a single 
scalar field $. 

In the semi-classical limit (small A) the probability per unit time per unit 
volume for formation of a bubble of true vacuum F is given by [19] 


T = Ae B/h (2.167) 


where 
B — SE (2.168) 


with Sg the Euclidean action for a solution of the Euclidean Euler-Lagrange 
equations which satisfies the boundary conditions that $ approaches the false 
vacuum (metastable minimum) as the Euclidean time tg — too, and with zero 
Euclidean time derivative at tg = 0, where 


te = it. (2.169) 


This is referred to as the ‘bounce’ solution (because it turns around and bounces 
back to the false vacuum.) The tunnelling is dominated by the solution for $ that 
gives the smallest value of Sz. The derivation of this result is by studying the 
imaginary part of the effective potential in the false vacuum. The coefficient A 
is, in general, more difficult to calculate [20]. However, since it does not appear 
in an exponent, an estimate on dimensional grounds is sufficient. At T = 0, we 
may expect A to be of order M^ where M is an appropriate mass scale, such as 
the height of the potential barrier to be tunnelled through or (see [21]) the value 
of (d? V/d$?)!/ ? at the metastable minimum, which will usually be of the same 
order of magnitude. 
For a single real scalar field, the Euclidean action takes the form 


Sp = f d'x (43 p3" o + V (9) (2.170) 


with the metric the positive-definite metric of four-dimensional Euclidean space, 
and the Euclidean Euler-Lagrange equation is 
929 


d dp = —- + V9 = V'(9). (2.171) 
at; 
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It can be shown that the bounce which minimizes Sg is O(4) symmetric, i.e. ¢ is 
a function of the four-dimensional radial variable p alone, where 


p =tet+x’, (2.172) 


co 2 
Se = an? f dp p? E ($) + vo (2.173) 
0 p 


and the equation of motion is (exercise 4) 


Then 


EP 4 lt9 2 y'g). (2.174) 


In terms of this variable, the boundary conditions for a bounce solution are 
$ — $4 as p — oo, and dédp = 0 when p = 0, where $, is the value of 
$ at the metastable minimum. 

An explicit bounce solution is most easily obtained in the so-called ‘thin- 
wall’ approximation [19] which treats the energy difference e between the two 
vacua as small compared with the height of the potential barrier between them. 
Then, we write 


V(9) = Vol) + O(e) (2.175) 


where Vo(#) is the effective potential in the limit that we neglect the energy 
difference between the two vacua, and 


€ = V ($+) — V($-) (2.176) 


where $4 and @~ are, respectively, the values of $ at the metastable and absolute 
minima. The Euclidean equation of motion is approximated first by replacing 
V'($) by Vo) in (2.174). We shall see later that, in this approximation, it is also 


correct to neglect 1$. in which case the equation of motion to be solved for the 
bounce solution becomes 


d?o 


ap? = Vo(@). (2.177) 
It is not difficult to show that the solution is 
p= de (2.178) 
V2V0(¢) ` 


A simple example is obtained by taking 


2 
_Àfp_ m 
Void) = 5 (+ - =) (2.179) 
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which has degenerate minima at $ = +//A with Vo = 0. Then (2.178) leads 
(exercise 5) to 
H H 
$ — bo = -= tanh [$ (o - 0] (2.180) 
Va E 
where œo is the value of @ at some reference value p of p. Choosing po to be 
the value at which ¢ takes the average of its values in the true and false vacua, 


namely œo = 0, then 


— Ho 
$ = Tunt po). (2.181) 


Assuming that pp >> 4^, the length scale on which @ varies, then 


u 
ġ > -—- asp —>0 (2.182) 
Vi 
and, in any case, 
m 
> — as p > oo. 2.183 
$ JA p ( ) 


It will be seen later that this is correct for the bounce solution that minimizes 
Se. If (after lifting the degeneracy of the two vacua using the O(e) term in V) 
$ = $- = —u/A/À is the true vacuum and $ = $+ = nu] NX is the false vacuum, 
then the bounce solution describes a bubble of true vacuum embedded in the false 
vacuum with wall thickness of order po, where the rapid variation of $ occurs, 
separating the two regions. Under the assumption that po >> 4^, the radius of 
the bubble is large compared with the thickness of the wall, which explains the 
*thin-wall' approximation terminology. 

The next steps are to calculate B and to justify the various assumptions made. 
In the thin-wall approximation, 


¢(p) = — A for p « po (2.184) 
= Aem e| for p — po (2.185) 
= NA for p >> po. (2.186) 


The contribution to Se from outside the wall (p >> po) is zero because here 
1(d$/dpy + V(¢) ~ 0. The contribution from inside the wall (p < po) is 
obtained by first noting that here 


1 (de? 
zbT V($) x -e. 2.187 
5 ( =~) tV) ( ) 
As a consequence, the contribution to Sg from inside the wall is 
2 
um 
Sp c: — ——epq. (2.188) 


2 
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The remaining contribution to Sg comes from the region within the wall. Noting 


that (2.178) implies that 
d 
= f2Vo (2.189) 
p 


this contribution to Sg may be written as 


~ 2x79, if dp 2Vo(¢) = 2x2 [ d$ /2Vo(9). (2.190) 
Thus, the total value of Sg in the thin-wall approximation is 


n? 
Sg = -Teo + 20? p21 (2.191) 


where 


$+ 
l= Í dé /2Vo0(¢). (2.192) 
$- 


Minimizing Sg with respect to pọ, to find the bounce solution that dominates the 


tunnelling, gives 


3 
po = — (2.193) 


Thus, when € is small, po is large compared with ~!, which justifies an earlier 


assumption. It also follows that the small energy difference between the two 
phases does indeed correspond to a bubble of true vacuum with a thin wall. 
Moreover, the neglect of ise is also justified because outside or inside the 


bubble iE: is negligible because $ is slowly varying, and within the bubble wall 


i IS is negligible because po is large. With po given by (2.193), the 


minimum value of Sg deriving from (2.191) is 

272? 1^ 
2e? 

which provides the value of B for the bounce solution that dominates the 


tunnelling rate (2.167). With $4 = u/i, and Vo given by (2.179), it is 
straightforward to evaluate J to obtain 


E= (2.194) 


2 3 
I= M. (2.195) 
Then the tunnelling rate is given by (2.167) with 
872 12 
B= oe (2.196) 


Once the bubble of the true vacuum has materialized, it can be shown [22] that 
(in the thin-wall approximation) it materialises with radius p = pọ and that the 
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development in time of the bubble can be obtained by continuing in time from 
Euclidean time to real time. Thus, whereas in Euclidean time the surface of the 


bubble was at 
p= yÈ +x? = p (2.197) 


in real time the surface of the bubble is at 
x? — c? = po (2.198) 


(restoring the explicit speed of light c which we have been setting to 1). The 
quantity oo is, in general, on a sub-microscopic scale and so negligible. Thus, to 
a good approximation, the surface of the bubble is at x? = c?r?. Consequently, 
the radius of the bubble grows with the speed of light. 


2.10 Exercises 


1. Recast the effective potential of (2.62) in the form (2.77). 

2. Show that the zero-temperature effective potential deriving from (2.94) 
always has a minimum at $; = 0 when B > O anda > 0. Also, show 
that when B < 0 there can only be a minimum at the origin whena < 0. 

3. Check that the SU(S) symmetric, SU(4) x U(1) symmetric and SU(3) x 
SU (2) x U(1) symmetric minima of (2.123), (2.124) and (2.125) all have 
V =0. 

4. Derive the Euclidean action (2.173) and the equation of motion (2.174) when 
¢ is a function of the four-dimensional radial variable p alone. 

5. Check that the bounce solution satisfies (2.178) and that, for Vo given by 
(2.179), this leads to the explicit solution (2.180). 


2.1 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e Kolb E W and Turner M S 1990 The Early Universe (Reading, MA: 
Addison-Wesley) 

Olive K A 1990 Phys. Rep. 190 307 

Linde A D 1979 Rep. Prog. Phys. 42 389 

Bailin D and Love A 1993 Introduction to Gauge Field Theory (Bristol: IOP) 
Bailin D and Love A 1994 Supersymmetric Gauge Field Theory and String 
Theory (Bristol: IOP) 


Bibliography 


[1] Bailin D and Love A 1993 Introduction to Gauge Field Theory (Bristol: IOP) ch 17 


64 


[2] 
[3] 
[4] 
[5] 
[6] 
[7] 
[8] 
[9] 
[10] 
[11] 
[12] 
[13] 
[14] 


[15] 
[16] 
[17] 
[18] 
[19] 
[20] 
[21] 
[22] 


Phase transitions in the early universe 


Kirzhnits D A and Linde A D 1972 Phys. Lett. B 42 477 

Weinberg S 1974 Phys. Rev. D 9 3320 

Dolan L and Jackiw R 1974 Phys. Rev. D 9 3357 

Bernard C W 1974 Phys. Rev. D 9 3312 

Coleman S and Weinberg E 1973 Phys. Rev. D 7 1888 

Nanopoulos D V and Tamvakis K 1982 Phys. Lett. B 110 449 

Srednicki M 1982 Nucl. Phys. B 202 327 

Ellis J, Llewellwyn Smith C H and Ross G G 1982 Phys. Lett. B 114 227 

Nanopoulos D V, Olive K A and Tamvakis K 1982 115 15 

Srednicki M 1982 Nucl. Phys. B 206 132 

Bailin D and Love A 1984 Nucl. Phys. B 239 277 

Cremmer E, Ferrara S, Girardello L and van Proeyen A 1983 Nucl. Phys. B 212 413 

Bailin D and Love A 1994 Supersymmetric Gauge Field Theory and String Theory 
(Bristol: IOP) 

Gelmini G B, Nanopoulos D V and Olive K A 1983 Phys. Lett. B 131 52 

Olive K A and Srednicki M A 1984 Phys. Lett. B 148 437 

Enqvist K et al 1985 Phys. Lett. B 152 181 

Binetruy P and Gaillard M K 1985 Phys. Rev. D 32 931 

Coleman S 1977 Phys. Rev. D 15 2929 

Callan C and Coleman S 1977 Phys. Rev. D 16 1762 

Linde A D 1983 Nucl. Phys. B 216 421 

Coleman S and de Luccia F 1980 Phys. Rev. D 21 3305 


Chapter 3 


Topological defects 


3.1 Introduction 


When a phase transition occurs in the early universe, the alignment of the 
spontaneous symmetry breaking expectation value may be different in adjacent 
causal domains. In that case, topologically stable objects such as domain walls, 
cosmic strings and magnetic monopoles, referred to as ‘topological defects’, 
can be formed [1]. Similar topological defects are familiar in condensed state 
physics. For example, a ferromagnet is, in general, divided into domains 
where the spontaneous magnetization is aligned in a definite direction. At the 
boundary between two such domains, the direction of the magnetization evolves 
continuously between its direction in one domain and its direction in the adjacent 
domain to form a so-called ‘domain wall’. Another example of a topological 
defect is provided by a magnetic flux line in a type [I superconductor. In this case, 
the phase of a complex scalar field (associated with the Cooper-paired electron 
condensate) changes by 27n, where n is an integer, in going around a closed 
loop surrounding the flux line and the flux line carries n units of a quantum 
of magnetic flux. In addition to these two-dimensional and one-dimensional 
topological defects, there are also point defects. These are most familiar in 
a particle physics context as magnetic monopole solutions though analogous 
objects occur in superfluid ?He. In the context of particle physics and early 
cosmology, the corresponding topological defects are associated with the vacuum 
expectation values (VEVs) in electroweak or grand unified theories or with the 
various moduli that occur in superstring theories. 

Once such topological defects have been formed at a phase transition in 
the early universe, they can manifest themselves in various ways. The simplest 
manifestations are as a potentially substantial contribution to the energy density 
of the universe or, in the case of magnetic monopoles, as a relic density of 
particles carrying magnetic charge. More subtle manifestations are also possible. 
For example, relic cosmic strings may act as gravitational lenses or produce 
temperature fluctuations in the cosmic microwave background radiation. 
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Figure 3.1. Effective potential for $^ theory. 


In the following sections, we shall develop the theory of domain walls, 
cosmic strings and magnetic monopoles in turn. 


3.2 Domain walls 


When the effective potential in a field theory has two degenerate minima with 
V = 0, two-dimensional solutions of the field equations with finite energy per 
unit area can occur. The simplest examples of such domain-wall solutions [2) are 
obtained from the Lagrangian density 


L = 19,99" $ — V9) (3.1) 
with 
À 
vio) = 1G - 0 (3.2) 


where ¢ is a real scalar field and A and 7 are real constants. The potential V has 
minima with V = 0 at @ = +n and a maximum at @ = 0 (see figure 3.1). The 
idea is to construct a static solution for which $ evolves from one minimum for 
z — —ooto the other minimum for z — -too. In that case, the domain wall is in 
the x-y plane. Clearly, we can construct such solutions with the domain wall in 
any chosen plane. 

In general, for a static solution where $ depends only on z, the field equation 
is 


~z =V. (3.3) 
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As for the (approximate) bounce in (2.177), the solution satisfies 
| (de ? =V + 3.4 
2\dz/ — e G4) 


where c is a constant. The energy (per unit area) of the domain wall is given by 


[oli (aov? 
E= Í. E (2) + ve) dz (3.5) 


and so to obtain a solution with finite energy density, it is necessary to require that 
d$/dz > 0 as z > +œ. (V(¢) will already approach zero as z — too if we 
succeed in constructing a solution of the type we are looking for.) Thus, we must 
take c = 0. Then integrating (3.4) gives 


dó 
JB 9 
(analogously to (2.177)). For the choice (3.2), this gives 
J/2 ($) 
z — 2 = r——arctanh | — 3.7) 
nv n 


where zo is an integration constant. Different choices of this constant amount to 
moving the centre of the domain wall along the z-axis. Inverting (3.7) gives 


nV À 

$ = $y = Fn tanh | J (z J . (3.8) 
Asz — 00, $4 — Fn and, asz > —oo, $4. — +n. The two solutions $ = $4 
and @ = 6. are referred to, respectively, as the ‘kink’ and the 'antikink'. The 
kink evolves from the minimum at $ = —n forz — —oo to the minimum at 
$ = +n for z — +00 (see figure 3.2) and the antikink evolves conversely (see 
figure 3.3). Both domain walls have their centre at z = zo in the sense that $ = 0 
when z = zo. The ‘thickness’ of each domain wall is of order /(2/A)n7~!. This 
is a balance between the desire of the potential energy to make the wall as thin 
as possible and the desire of the gradient energy to make the wall as thick as 
possible. The total energy per unit cross-sectional area of a kink or antikink is 
finite because in (3.5) V(@) and d$/dz go to zero sufficiently fast as z — -too. 
Substituting the explicit solutions (3.8) into (3.5) gives the finite energy per unit 
area of a kink or antikink (exercise 1): 


E = VOAR. (3.9) 


The stability of a domain wall is associated with a topological principle. The 
Lagrangian (3.1) possesses a discrete Z2 symmetry 


23:0 > —ó (3.10) 
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Figure 3.2. Kink soliton solution. 


Figure 3.3. Antikink soliton solution. 


which is broken by the vacuum expectation value of $. The domain walls connect 
the two distinct minima which constitute the complete space of minima. There is 
no way to continuously deform a domain wall to a new object that does not have 
its ends at distinct minima, while keeping the energy density finite. 


The formation of such topological defects can be understood in terms of 
the Kibble mechanism [1], which we now describe. As discussed in chapter 2, 
symmetries are expected to be restored at high temperatures. As the universe 
cools, it passes through a phase transition and different regions of the universe 
undergo phase transitions to different minima of the zero-temperature effective 
potential. There will be some correlation length £ such that the VEVs for points 
of space separated by more than about £ are uncorrelated. The length £ cannot 
be larger than the particle horizon since no influence can have propagated over a 
distance greater than this from the big bang to the time of the phase transition. 
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In the present case, at high temperatures we expect ($) to be zero and, 
after the phase transition, some regions will have (6) = 7 and some will have 
(6) = —n. In this way topological defects can form. Although a domain wall can 
change its shape, topology prevents it from disappearing once formed because its 
ends are at discrete minima of V. In general, there will be curved domain walls as 
well as flat domain walls, with the curved walls enclosing a region of space inside 
which the VEV of ¢ differs from the VEV of $ outside. 

Many domain walls will form constituting a random network whose 
evolution with time may be studied. The result for non-relativistic domain walls 
is that the energy density (i.e. the energy per unit volume) of the domain walls 
scales as R^!, where R(t) is the scale factor of the universe. This should be 
compared with the energy density due to radiation which scales as R~* and 
that due to matter which scales as R^? (as in section 1.3). Consequently, as R 
increases with time, the energy density of the universe comes to be dominated 
by domain walls. The total energy associated with a plane domain wall with area 
Hy 2, where Ho is the present day Hubble constant, is far larger than the estimated 
total energy due to matter within the Hubble radius. For example, for A not too 
much different from 1 and n ~ 100 GeV, the former is larger by 12 orders of 
magnitude. A larger expectation value for the scalar field makes things worse 
(exercise 2). Thus, domain walls appear undesirable. This suggests that a theory 
is needed which does not have disconnected vacuum states, such as ($) = +n in 
the present model, to avoid the existence of domain walls. Alternatively, a period 
of inflation (see chapter 7) is needed to dilute the domain wall density. 


3.3 Global cosmic strings 


One-dimensional topological defects (cosmic strings) can also be produced by 
phase transitions in the early universe. The simplest example of a cosmic string 
[3] may be derived from the Lagrangian density for a complex scalar field $: 


L= 8,9*9"$ — Vid) (3.11) 
with à 
Vio) = z (9*6 - my (3.12) 


and A and 7 are real constants. This Lagrangian possesses a global U(1) 
symmetry under . 
$ — e^o (3.13) 


where œ is an arbitrary constant real number. The potential V of (3.12) has a 
maximum at $ = 0 and minima with V = 0 when 


$ = ne? (3.14) 


where f is an arbitrary real number. The vacuum VEV (3.14) breaks the global 
U (1) symmetry because it is not invariant under the transformation (3.13). 
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It is possible to construct other extended static solutions as follows. Take 
cylindrical polar coordinates (p, 0, z). The solutions in question have the form 


o = ne"? f (p) (3.15) 


for some integer n and the function f(p) is to be determined from the field 
equations. In cylindrical polar coordinates, these are (exercise 3) 


d? 1d 2 
Eg pc -n (3.16) 
where 
E = Al gp, (3.17) 


The phase of $ will become undefined at p = 0 unless |@| + O0 as p — 0. Thus, 
the boundary condition 


fi> asp > 0 (3.18) 
is required to ensure a single-valued field $. Also, there is the boundary condition 
f@)-> 1 as p — oo (3.19) 


so that $ approaches one of its continuum of minima (3.14) in order to minimize 
the energy. Equation (3.16) may be solved numerically with these boundary 
conditions. The scale of the distance is set by A!/2n so the vortex line or cosmic 
string has a core of radius of order A^ !/25-! outside of which $ approaches its 
minima as p — oo and inside of which $ — 0 as p > 0. 


The energy E of the vortex line or cosmic string is given by 
E- f d’x [Vp" - Vo + V(9)]. (3.20) 


Taking cylindrical polar coordinates, the energy per unit length (along the z- 
direction) of a cosmic string of length / is 


E oo 2n a¢* 8d | | 89*8ó 
—= d do —+—— . 
jJ, "f e ap * p! 06 35) 6-21) 


because (3.14) is independent of z. The last term in (3.21) gives a contribution 
to E/I proportional to Jo. f?p^! dp and, because f — | as p > oo, this 
contribution is logarithmically divergent. Considering the energy inside a cylinder 
of radius R and recalling that A7 !/25 sets the length scale, we must get 


E 
T" n(A nR). (3.22) 


This global cosmic string resembles the vortex line in superfluid “He where ¢ is 
the condensate wavefunction. 
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Like a domain wall, a (global) cosmic string is stabilized by topological 
considerations. A cosmic string with asymptotic behaviour e/"? as p — oo is 
said to have winding number n. The space of vacuum states (minima of V) is 
characterized by e/ (as in (3.14)) and so is just a circle 5!. If (for fixed z) we 
draw a circular path of large radius in real space encircling the core of the cosmic 
string (where $ = 0), then, as we go once around this path in real space, the field 
$ goes n times around the circle S! which is the space of vacuum states. Provided 
that the cosmic string is either of infinite length or forms a closed loop, this is a 
property of a cosmic string that cannot be changed by continuous deformations. It 
is a topological quantum number which, at least at the classical level, guarantees 
the continued existence of a vortex line once formed, unless it encounters other 
vortex lines or divides into more vortex lines in such a way that n is conserved 
(e.g. into n vortex lines with unit winding number.) 

If we denote the space of true vacua (minima of V) by M, then the 
topological entity involved is the homotopy group 71(M). In the present case, 
the relevant homotopy group is zr (S !) which is known to be Z, i.e. isomorphic 
to the integers. The winding number n € Z expresses this fact. 


3.4 Local cosmic strings 


If a complex scalar field is coupled to a gauge field, e.g. the electromagnetic field, 
then the Lagrangian possesses a local symmetry, rather than a global symmetry 
as in section 3.3, and a so-called ‘local cosmic string’ [4] or gauge string can 
occur as a solution of the field equations. The simplest example is provided by 
the Higgs model, which is the theory of a complex scalar field coupled to a U(1) 
gauge field which we may take to be the electromagnetic field. The Lagrangian is 
that of section 3.3 amended to incorporate the gauge coupling. Thus, 


L = (D,O)* (D^ 9) — $ Fp FH” — Vio) (3.23) 
with V ($) given by (3.12) and 


Dud = (Ou t ieA,)Ó (3.24) 

Fav = O,Ay — 0A, (3.25) 

where A, is the electromagnetic four-potential and e is the charge of the scalar 

field. This is the same model as that studied in section 2.4, after adding a constant 
to V and with a different definition of A. 


As in the discussion of global cosmic strings, V has a minimum at @ = nef. 
However, the Lagrangian now possesses a local U (1) symmetry under 


o — i^o (3.26) 
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which is broken by the VEV of $. Using cylindrical polar coordinates, we look 
for solutions for @ of the form (3.15), as before, but we must now also determine 
Ax. For large p, with the boundary condition (3.19) so that @ approaches one of 
its minima, . 

$ ~ nein? as p — oo. (3.28) 


If we also arrange that A,, has the boundary condition 
Ay ^ —ie lü,In($/m) ^ asp — oo (3.29) 


then D, $ and Fuy both approach zero for large values of p and the energy density 
vanishes for large p. Indeed, when the complete solution with these boundary 
conditions is constructed numerically, in a way that we shall discuss shortly, the 
energy density approaches zero fast enough as p — oo that this cosmic string has 
finite energy density per unit length. 

The local cosmic string or gauge string carries magnetic flux. The amount 
of flux may be determined by integrating over the area of a circle of large radius 
R in the (p, 9) plane with the asymptotic form (3.29). Then (exercise 5) 


f B -dS = f A -dl =2nne~'. (3.30) 


Thus, the local cosmic string characterized by winding number n carries n units 
of magnetic flux 2ze^!. Local cosmic strings are, therefore, quantized tubes of 
magnetic flux analogous to flux lines in a superconductor. 

To construct the required (static) solution for ġ and A,, we take 


$ = ne"? f (p) (3.31) 
as for the global cosmic string. Then, since 


ad. 186. dg; 
ve =—p+——6+ —k 
$= 355* 5555 3; (3.32) 


the boundary condition (3.29) suggests that we should take A,, to have non-zero 


components only in the 9 and @ directions. Working in a gauge in which the 
component A, = 0, we take 


n ^ 
A = —a(p)ó. (3.33) 
ep 


The functions a(p) and f (p) are then determined numerically by solving the field 
equations 


D,D“ + (6*6 — 17) —0 (3.34) 
dy F"" + ie($* Do — $(D"9)*) 0 (3.35) 


subject to the boundary conditions (3.28) and (3.29). 
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There are now two length scales in the problem instead of one as in the case 
of the global string. There is the mass mg of the scalar field after spontaneous 
spontaneous symmetry breaking which is obtained by substituting ($) in V(@). 
This gives 

m2 = n. (3.36) 
It is mg that controls the rate of variation of $ for large p and, therefore, the 
variation of f (p). There is also the mass m4 of the gauge field after spontaneous 
symmetry breaking which is obtained by substituting (6) in the (D,$)* (D^$) 
term. This gives 

m =en. (3.37) 
It is mą that controls the rate of variation of A for large p and, therefore, the 
variation of a(p). The approximate solution is found to be of the form 


S~- fig exp (- z£) as E — oo (3.38) 
a~1—ayé'/exp(-&) as — oo (3.39) 
where fı and a; are constants and 


£zmap. (3.40) 
It can be seen from (3. 25) and (3.39) that $ is localized on a scale ms ! and A 


is localized on a scale m, "i . As a consequence, the energy density is "localized 
without introducing a cut-off. 

The energy per unit length of a local cosmic string may be estimated as 
follows. The cosmic string has an inner core where $ is approximately zero (i.e. 
a core of false vacuum) with radius 


Rg mS! =A?! (3.41) 
and a tube of magnetic flux of radius 
Ra m4! = egy. (3.42) 


The energy density obtained by putting @ = 0 in V(¢) is Jan. Thus, there is an 
energy density per unit length from the inner core of the cosmic string of order 
5An*n RZ ~ n°. There is also an energy per unit length from the tube of magnetic 
flux of order B? RZ where B is the magnitude of the magnetic field strength. From 
(3.30), with one unit of flux, we estimate 


B~ Rie. (3.43) 


Thus, using (3.42), we find that B? RÈ ~ n? and both the magnetic and inner-core 
contributions to the energy per unit length 4 are of the same order. The total has 
order of magnitude 

un. (3.44) 
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3.5 Gravitational fields of local cosmic strings 


To calculate the gravitational field due to a cosmic string, the energy-momentum 
tensor produced by the string is required. (See section 5 of the review by Vilenkin 
in the general references at the end of this chapter.) For cosmological purposes, 
we are interested in cosmic strings of length much greater than the radius of the 
inner core or the flux tube. We therefore average the energy-momentum tensor 
over the core of the cosmic string and treat the string as having zero radius. Thus, 
for a long straight line with axis along the z-direction, we replace the energy- 
momentum tensor Tyy by Ty» where 


T,» = 6(x)8(y) Í Tav dx dy. (3.45) 
core 


Invariance under Lorentz boosts along the z-direction shows that Too = 733 and 
there are no off-diagonal components. The conservation law for the energy- 
momentum tensor 

D'T,, =0 (3.46) 


must also be imposed; D" is the gravitational covariant derivative. Then, by 
considering f DT; jx* dx dy and integrating by parts, we conclude that 


Tie =O  fori,.k—1,2. (3.47) 
Also, since the total energy per unit length is 4, we can now write 
T,» = n diag(1, 0,0, 1)8(x)8(y) (3.48) 
so that f Too dx dy = p. 

With the energy-momentum tensor (3.48) for the local cosmic string, 
Einstein’s field equations can be solved in the limit Gyu « 1 for the metric 
in the region outside an infinitely long straight string [6]. In cylindrical polar 
coordinates (p, 0, z), the result for the proper-time element dr is 

dr? = dt? — dz? — dp? — (1 — 4G uy? p? do”. (3.49) 
This can be recast as the metric of flat Minkowski space by the transformation 
6 = (1—4Gyp)0. (3.50) 
However, for 0 < @ < 27, we have 


0<6 «2x(1— 4Gyup) (3.51) 


which limits the range of 6. This is called a "conical singularity'. Space with a 
conical singularity is the same as flat space 0 < 6 < 27 with the angular region 


Gravitational fields of local cosmic strings 75 


Figure 3.4. Conical singularity on the space outside a local cosmic string. © is the axis of 
the string. 


between 27 (1 — 4G y u) and 27 removed. In addition, because 0 = O and 0 = 2x 
are to be identified, we must identify 6 = 0 and 0 = 2x — A0 where 


A0 = 81 Gy. (3.52) 


Then, looking along the z-axis, space looks as in figure 3.4 with the hatched area 
removed and points at the same value of p on the dotted lines identified. The 
conical singularity in the space outside a long straight local cosmic string has 
several striking consequences: 


3.5.1 Double images [5] 


A galaxy located behind a local cosmic string, from the perspective of the 
observer, will acquire a double image. Consider, for simplicity, an infinitely 
long straight cosmic string norma! to the plane of the page in figure 3.5 with 
the observer w and the galaxy g being observed in the plane of the page. Because 
points A and B are identified, as discussed earlier, two images of the galaxy are 
seen, emanating from A and B, The angle between the two images Aq is given 
by 

di Aa = d240 (3.53) 


where d and d? are, respectively, the distances from the observer to the galaxy 
and from the cosmic string to the galaxy. As a consequence of (3.52), 


d 
Aa = — 8x Gui. (3.54) 
1 
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Figure 3.5. Double image of a galaxy behind a cosmic string. © is the axis of the string, 
w is the observer and g is the galaxy. 


3.5.2 Temperature discontinuities [6] 


Consider a local cosmic string moving perpendicularly to the line of sight of an 
Observer observing the cosmic microwave background radiation coming from far 
off (dy — d? in the previous discussion). There are two images P; and P2 of 
the same point separated (see figure 3.6) by an angle Aa ~ 8x Gyg. If the 
relative velocity of the cosmic string and observer is v, then P, and P5 have a 
component of velocity of order Aa antiparallel or parallel to v respectively. As a 
consequence, there is a Doppler shift in the temperature of the radiation between 
the two points. This results in a discontinuity ôT /T from one side of the cosmic 
string to the other of order 8x G y |v|. 


3.5.3 Cosmic string wakes [7] 


A long straight cosmic string moving with velocity v across the universe will 
deflect particles of matter. A wedge of matter with opening angle 87 Gyu and 
radius vt forms as a wake in time t. This may be relevant to structure formation. 


3.6 Dynamics of local cosmic strings 


Once a network of cosmic strings has formed, following a phase transition in the 
early universe, its evolution depends on the emission of gravitational radiation by 
string loops. In principle, the classical field equations (3.34) and (3.35) provide 
the equations of motion for a local cosmic string. In practice, when we are 
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Figure 3.6. Temperature discontinuity due to a cosmic string. © is the axis of the string, 
w is the observer and P, and P5 are two images of the same point. 


— Tt 


Figure 3.7. World sheet for a cosmic string. 


neglecting the radius of the string core, it is much simpler to use the equations of 
motion for a relativistic string of zero radius which derive from the Nambu—Goto 
action and are identical to the equations of motion for the fundamental bosonic 
string. 

We denote by X^ the position in spacetime of points on the axis of the 
cosmic string (where the Higgs field $ is zero.) Whereas a point particle may be 
described by degrees of freedom X^ (t) depending only on a timelike coordinate 
t, to describe a string we need, in addition, a spacelike coordinate o. which we 
may take, for convenience, to be in the range 0 < o < n. Then, the string degrees 
of freedom X^ (t, a) trace out a curve (see figure 3.7) as o varies at fixed r. The 
action S for a relativistic string propagating in Minkowski spacetime is of the 
form 


T ty n 
S=-3 f dr Í do (— det h) "2598 5,8, X" 3g X" (3.55) 
ti 
where T is the string tension, 


nuv = diag(1, —1,..., —1) (3.56) 
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and hyg(t, a) is a world-sheet metric of signature (+,—) where a = 0 and 
] refer to r and ø respectively. This action displays two-dimensional world- 
sheet reparametrization invariance and also possesses conformal invariance under 
a local rescaling of the world-sheet metric 


óhag = A(t, o)hagp | óX" =0. (3.57) 


With the aid of world-sheet reparametrization invariance, the world-sheet metric 
may be reduced to the form 


hap(t,0) = er Ona, (3.58) 
where 
Nop = diag(1, —1). (3.59) 
With the aid of conformal invariance, it may be further reduced to 
hap = Nap. (3.60) 


The gauges (3.60) are referred to as ‘covariant’ gauges. There is still further gauge 
freedom which we shal! exploit shortly. 

In a covariant gauge, the equations of motion of the string, obtained by 
varying with respect to X^ and hag (exercise 6), take the simple form 


0,9" X" = p s X" —0 (3.61) 
* BUE i ` 


with the constraints 
OX" 0X, | 0X" aX, 


——— = 0 ` 
ðt OT + ða ða (3.62) 
9X" 0X, 

—— —— cQ. . 
dt ðo (3.63) 


For a closed string loop, there is the boundary condition 
X" (t,o +2) = X*(1,0). (3.64) 


The remaining gauge degrees of freedom may be used to choose the ‘temporal’ 
gauge in which r is identified with X? = t (Minkowski time.) Then, the equations 
of motion and constraints become 


a op 
(s — =) X20 (3.65) 
aX ax 
D ao T? (3.66) 


ax\? fax v 
(a) (35) =l. (3.67) 


Dynamics of local cosmic strings 79 


IN XX 


Figure 3.8. Intercommuting cosmic strings. 
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Figure 3.9. Cosmic string loop intercommuting with itself. 


These equations have oscillatory solutions and these will allow string loops to 
radiate. Because the string loops are relativistic, the quadrupole formula for 
gravitational radiation cannot be used. A relativistic calculation shows that the 
power P emitted in gravitational radiation by a string loop is given by 


P -yGyu? (3.68) 


where Gy is Newton's constant and y is a number of order 100 which depends 
on the particular loop [8]. As in section 3.5, yz is the energy per unit length of the 
cosmic string. 

This gravitational radiation is important for the development in time of the 
network of cosmic strings that formed at a phase transition. Also important is 
the process of intercommuting [1,9] as in figure 3.8. In particular, a string loop 
may intercommute with itself as in figure 3.9 to produce two smaller loops. When 
the evolution of a string network is studied [10], allowing for these two effects, 
it is found that strings will not dominate the present day energy density of the 
universe. However, apart from individual relic strings producing the observable 
effects discussed earlier, the evolution of the string network will leave a relic 
gravitational wave background as a result of gravitational radiation emission by 
oscillating string loops. Since the gravitational emission is controlled by Gyu’, 
this will set a limit on u if this gravitational background is not to undo the 
predictions of the standard model for nucleosynthesis. This is found to require 
[Il] Gnu S 1075. There is, however, a tighter bound [12] set by the magnitude 
of the cosmic microwave background fluctuation of Gyu S 1076. It is possible 
that particle production rather than gravitational wave emission dominates the 
energy loss from oscillating cosmic string loops. In that case (13], there is an 
even tighter bound Gyu € 1079. 
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3.7 Magnetic monopoles 


It is also possible for point topological defects, magnetic monopoles [14], to form 
at phase transitions in the early universe. The simplest model exhibiting this is 
an $O(3) gauge field theory with 5O(3) spontaneously broken to U(1) by the 
expectation value of a scalar field ¢ in the three-dimensional representation of 
SO(3). The Lagrangian density for this model is 


l a 
L= D$: D'$- z FiF} — 5-6 - my (3.69) 


where the gauge field strength is 


Fi, = 04 AS — bv At, — Beabc Ap Ap- (3.70) 


The covariant derivative of the scalar field is 
Duba = dupa — 8€abc A, óc (3.71) 


and a, b, c take the values 1, 2, 3. Minimization of the effective potential 
À 
V-g9-$-n» (3.72) 


fixes 
l$l = n. (3.73) 


However, because of the SO(3) symmetry, the direction of ¢ is not fixed. 
The magnetic monopole solution [16] is a spherically symmetric solution for 
¢ of the form 
$ — nf (rr (3.74) 


which is a mapping from ordinary space to the SO(3) space, with the asymptotic 
behaviour 
o~nr asr — oo (3.75) 


The spatial variation of $ will be determined by the covariant derivative D,,$ 
and so gn must enter the r-dependence. On dimensional grounds, we can write, 
without loss of generality, 


H ^ 
= a (3.76) 
where 
E = gnr. (3.77) 
The required behaviour as r — oo is obtained if 
H 
HG) >] as E > oc. (3.78) 


d 
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In the absence of gauge fields, the contribution of the scalar kinetic term to the 
energy is given by 


E= TEEI (3.79) 


where i = 1,2,3 is a (summed) spatial index. In spherical polar coordinates 
(r. 6, 9), 26 | a 1 a% 
a^ aa a4 
0 . 3.80 
or r 980 + ind 26 * ( ) 
Since Ê is a function of 0 and @, but not of r, the large r behaviour of the integral 
(3.79) for the energy of the monopole is controlled by 


oV 1 far 


Thus, in the absence of a gauge field contribution to E, the energy of the monopole 
solution would be infinite. 

To find a finite-energy solution, we need the gauge field contribution to the 
covariant derivative to produce a cancellation to ‘improve’ the behaviour of V ġa 
for large r. This possibility may be studied by making the following ansatz for 
the gauge field expectation value: 


Ag = HU ee) — 1). (3.82) 
gr 


Then (exercise 7) 
_ K(&)H (5) 
gr* 


Fafi 


Diba = (rà = rari) + CH'E) - HEDEF — 89 
A dangerous term of the type H (Er? ai /gr* has cancelled between the 3j$; and 
—g€apc AP $c contributions to Diġa. For FHG) — | as & — oo, this term would 
have had the unwelcome asymptotic behaviour r^! as r — oo. The surviving 
terms are of order r7? as r — oo, provided K (£) H (£) and £H'(£) — H (£) are 
finite for E — oo, and a divergent contribution to the energy of the monopole is 
avoided. 

With the ansatz (3.82) for the expectation value of the gauge field, the field 
strength is given by 

K? -| K' K?-| 
BF; = —H fia + (5 — E (€iapr pr j — € jap? pri). (3.84) 


The energy of the magnetic monopole solution may now be written in terms of H 
and K as 


Lm 
8 
l2 yd yp Ày 22 
+ 5(K - 1) taz” -ny |. (3.86) 


co 
E Í aelem - MP+ HK? + ex (3.85) 
0 
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Minimizing with respect to variation of H and K gives (exercise 8) 
EK" = KH? + K(K? — 1) (3.87) 
6?H" =2K7H + sh? — §%), (3.88) 


There is an analytic solution (15] in the limit A/g? -— 0, 


H(€) = £coth£ — 1 (3.89) 
K (E) = E coseché. (3.90) 


Note that K (£) H (£) and £ H'(£) — H (£) are finite as £ — oo, as required earlier 
to avoid a divergent contribution to the energy of the monopole solution. In fact, 


K(E)H(£) 5 0 as£ — oo (3.91) 


in this limit. The corresponding energy, which is (at least at the classical level) 
the mass my of the monopole, is given by 


HM = —. (3.92) 


More generally, it has the form 
4 A 
my = T^ ( ) (3.93) 


where h turns out to be a slowly varying function. 
The monopole solution carries a magnetic charge, which we now evaluate in 
the limit A/g? — 0. The solution (3.90) implies that 


K(£) —0 as E — oo. (3.94) 


Thus, as £ — oo, (3.82) reduces to 


a_ _ failli 
i 77 gr? 


as£ > oo. (3.95) 


We shall refer to 
Bj = (V x A*)i = $e Fi (3.96) 


as the *magnetic' field, though we are not dealing with electroweak theory here 
because the gauge group is SO(3) rather than SU(2) x U(1). With the gauge 
field expectation value given by (3.95), the magnetic field is (exercise 9) 


BY = 


i — gre (3.97) 
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The surviving U(1) gauge group after spontaneous symmetry breaking by the 
scalar field expectation value (3.74) is the group of rotations about the direction 
F. The magnetic field should, therefore, be identified with the component B of 
B" in this direction: . 

F 
gr? 
Thus, the magnetic field is the field of a magnetic monopole with magnetic charge 
Anje. 


B= (3.98) 


3.8 Monopole topological quantum number 


The asymptotic form ¢ = nÊ of the magnetic monopole cannot be deformed 
continuously to the trivial configuration @ = n2 and, for this reason, the magnetic 
monopole is topologically conserved once formed. This is reflected in the 
existence of a topological quantum number defined as follows. The sphere M 
of all solutions for the expectation value of œ which minimizes the tree-level 
effective potential is given by 

Ió| = n. (3.99) 


This defines the surface of a sphere and $(7) can be thought of as a mapping from 
the surface X of a sphere in coordinate space to the surface M of the sphere in 
the space of solutions. If we parametrize the surface X by parameters u and v, 
then the element of surface on the space M is 


dS = dSn = E: x æ) du dv. (3.100) 
ðu ðv 


The unit normai n to the sphere is 7? and so 


TEE x) du dv = ndS. (3.101) 
ðu ðv 


As the parameters u and v are varied to allow F to sweep out the surface X of a 
sphere in coordinate space, @ sweeps out the surface M of a sphere in the space ` 
of solutions. For the single monopole solution that we are discussing, as the first 
sphere is swept out once so is the second sphere. More generally, for a multi- 
monopole solution, it is possible for the second sphere to be swept out N times, 
where N is an integer, as the first sphere is swept out once. Thus, 


f $- E: x *) du dv = Anm N. (3.102) 
E du ðv 


If we continuously deform ¢, N cannot change because it is an integer, one in the 
case of the single-monopole solution. It is a topological quantum number, which 
reflects the fact that a magnetic monopole configuration once formed cannot be 
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continuously deformed to the trivial configuration. Thus, magnetic monopole 
configurations are stabilized in a topological way. 

The topological quantum number N is related to the magnetic charge. This 
can be demonstrated as follows. Denote the elements of surface and the unit 
normal on the sphere in coordinate space by dS and à. Then 


- dra 
dā = (= x x) du dy (3.103) 
ðu ðv 
from which it follows that 
~~ a(r;, rk) 
eijk dŠ ñi = 05 ¥ dv. (3.104) 


Noting that 

0$ 3$ 1/86 I Or; re) 

kK = slo * = | SO l 

du av 2 (s * are) Olu, v) (3.105) 
and using (3.103), the topological quantum number N may be recast in terms of 
coordinate-space derivatives of $ as 


Ang N- ; Í eijtñiġ - ($ x x) d$. (3.106) 
The integral (3.106) may be formulated as a surface integral of $- F jy on a sphere 
= of large radius, where the F "m are the spatial components of the gauge field 
strength defined in (3.70). For this purpose, we need a solution for F jg in terms 
of $ valid for large r. As discussed in section 3.7, for a finite-energy solution 
there is a cancellation between the two terms in the covariant derivative 


Did = ði — (Ai x $) (3.107) 


such that the covariant derivative is of order r~? forr — oo, whereas, separately, 
the two terms are of order r^! . Thus, for large r, 


9;$ = g(Ai x $) (3.108) 
from which it follows that 
1 1 
Ai = — ($ x 016) + Said (3.109) 
8n n 
where 
a; = @- Aj. (3.110) 


The corresponding expression for $ - F jy obtained from (3.70) is 


1 
$.Fj- an? : (8j6 x Ip) + (Aja, — kaj). (3.111) 
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Combining (3.106) with (3.111), the topological quantum number N may be 
written in terms of the field strength as 


gn? c 
Ang N = E. f ene Fp dS. (3.112) 


Finally, N may be recast as a surface integral of the magnetic field on a sphere 
X of large radius. As discussed in section 3.7, the magnetic field in the model 
with SO(3) gauge group should be identified with the component B? in (3.96) 
along the direction about which the surviving U (1) gauge symmetry is the group 
of rotations. In general, this is the direction |$| ^! $. Consequently, the magnetic 
field B is given by 


1 
Bi = 2,09 » F jx. (3.1 13) 
It follows from (3.112) that 
g MEE" 
v= [ n.aas. (3.114) 
4z Jy 


Thus, the topological quantum number N measures the magnetic charge in units 
of 4x /g. 


3.9 Magnetic monopoles in grand unified theories 


In general, if we start with a grand unified group G and the symmetry is 
spontaneously broken to H (at a phase transition), then the action of any element 
of H leaves a vacuum state invariant. Consequently, distinct vacuum states 
correspond to the coset manifold G/H. The logic behind this is that G invariance 
of the effective potential means that starting from any vacuum state, we can 
generate further vacuum states (degenerate in energy) by acting with elements 
of G. However, when the element of G in question is an element of the subgroup 
H, it does not produce a new vacuum state. The topological entity underlying 
the existence of stable magnetic monopoles is the second homotopy group for 
G/H denoted by z2(G/ H), whose elements are inequivalent mappings from the 
surface of a two-sphere 5? to G/H, i.e. mappings which cannot be continuously 
deformed into each other. There is a theorem that z2(G/ H ) can be identified with 
7 (H)/71(G). Here, x(G) is the first homotopy group of G whose elements are 
inequivalent mappings from a circle $! to G and similarly for H. 

In the example just considered, G = SO(3) and H = U(l) Also 
m(U(1)) = Z, the integers, with the value of the integer being the winding 
number, i.e. the number of times we wind around the circle defined by U (1) as we 
wind once around the circle in coordinate space. Less obviously, z1($0(3)) = 
Z2, the integers modulo 2. In this case, therefore, z12(G/ H) = Z/Z2 or the even 
integers. This is why we found magnetic charges in multiples of 4zx/g which is 
twice the Dirac magnetic monopole charge. 
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When H is SU(3) x SU(2) x U(1), which will be the case for a phase 
transition in which the grand unified theory breaks spontaneously to the standard 
model, then x; (H) is just 71(U(1)) = Z, because 7,(SU(3)) and m(SU (2)) 
are both trivial. Thus, for such a spontaneous symmetry breaking, the resulting 
second homotopy group is Z/m1(G). In particular, if x(G) is trivial, as is the 
case for the SU (5) grand unified group discussed in section 2.6, then 72(G/H) = 
Z and we have magnetic monopole solutions. 

We now ask what masses are possessed by the magnetic monopoles in grand 
unified theories. By analogy with (3.92), the magnetic monopole mass will be of 
order 42 n/2G, where n is the expectation value of the Higgs scalar responsible for 
breaking the grand unified symmetry and gg is the value of the gauge coupling 
constant for the grand unified group at the unification scale. In the case of the 
SU (5) grand unified theory of section 2.6, n, which is identified with $c, is of 
order 10? GeV and gg is of order 1. Thus, we expect the magnetic monopole 
mass m m to be of order 10!6 GeV. In the case of the supersymmetric SU (5) grand 
unified theory of section 2.7, with a unification scale of 2 x 10!6 GeV, which is 
1.5 orders of magnitude greater than in the non-supersymmetric case, a magnetic 
monopole mass of order 10!7-10!8 GeV is to be expected. 


3.10 Abundance of magnetic monopoles 


Magnetic monopoles form as the phase transition from the SU(5) symmetric 
phase to the standard model SU(3) x SU(2) x U(1) phase occurs. This is 
the result of the expectation values of the Higgs field only being correlated over 
some finite distance. The expectation values of the Higgs field at different points 
in space will not be aligned to produce a uniform Higgs field over distances 
greater than this. Thus, we can expect topologically non-trivial configurations 
to be produced, in particular, magnetic monopoles. The number of magnetic 
monopoles formed [16, 17] should be determined as to order of magnitude by 
the distance over which the Higgs expectation values are correlated [1,9]. 

There are two effects which can limit the range over which this correlation 
occurs. The first is the statistical-mechanical thermal average over the product 
of the two Higgs fields. For a second-order phase transition, this correlation 
length is of order 7; ! but can be larger for a first-order phase transition, which 
proceeds through the formation of bubbles of the low-temperature phase which 
then coalesce. The second effect is the general-relativistic particle horizon dy. 
Correlations cannot occur over distances greater than the distance dy that light 
has been able to travel since the big bang. For a Friedman-Robertson- Walker 
(FRW) universe, as discussed in section 1.2, the proper distance at time t from 
any point to the particle horizon is 


dr’ 


RO (3.115) 


t 
dy(t) = no | 
0 
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Barring cosmological inflation (to be discussed in later chapters), the growth of 
R(t) with time is according to the power law 


R(t) ^ tr" (3.116) 


(with n = 5 for a radiation-dominated universe). Then 
f 
dy(t) = —— (3.117) 
l-n 


provided n # 0. Thus, dg (t) is of order t. We require the particle horizon at 
time tc that the phase transition is completed. A slightly different discussion is 
required for second-order (or weakly first-order) and first-order phase transitions. 

For a second-order phase transition, the phase transition is completed at the 
critical temperature 7,. For the radiation-dominated era of the FRW universe, 
there is the connection (see section 1.3) between the time t since the big bang and 


temperature T: 


~ -1/2MP 
(= 0.3N, T? 


where m p is the Planck mass (~10!9 GeV) and N, is the effective number of 
degrees of freedom at temperature T: 


(3.118) 


N, = Ng t &Nr. (3.119) 


Ng and Np are, respectively, the numbers of bosonic and fermionic degrees of 
freedom for particles with mass small compared to T, in the sense described after 
(2.19). For approximately one monopole per horizon volume, the number density 
n M of monopoles should be 
nu (Te) ~ (du (1) 2 ~ 123 ~ Ne? Tm 5(0.6)? (3.120) 
where we have taken 
di (tc) = 2tc (3.121) 


for the radiation-dominated era. If we compare this with the entropy density of 
(2.21), 


2m? 3 
- J 
s= TMT (3.122) 
then (Ta) 
NM\ic 1/273, -3 
IMEL ~ 10.6NL? 7353. 123 
s(T.) * emp 3.123) 


At temperatures below T. but above the electroweak phase transition, the 
appropriate value of N, is that for the SU(3) x SU(2) x U(1) standard model: 


Ne = 106.75. (3.124) 
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s(Te) mp 


Assuming that the expansion of the universe for T < Te is adiabatic, then 
s o R^ and the ratio nM(T)/s(T) does not change. As a consequence, the 
monopole contribution Rmh? to Rmh? today is predicted to be many orders 
of magnitude greater than the observational bound of about 0.15. For a non- 
supersymmetric GUT theory with T; of order 105 GeV and a magnetic monopole 
mass my of order 1076 GeV, QA? is 14 orders of magnitude greater than this 
upper bound. For a supersymmetric GUT theory with T+ of order 10 GeV and 
a magnetic monopole mass my of order 10!7—10!8 GeV, the situation is even 
worse with Rmh? some 18-19 orders of magnitude greater than the upper bound 
(exercise 10). 

In the case of a first-order phase transition, the transition does not proceed 
until some temperature below T, at which the bubble nucleation rate for bubbles 
of the low-temperature phase is of the same order as the expansion rate H for 
the universe. We expect the Higgs expectation values to be correlated within 
a bubble but uncorrelated between any two bubbles. Thus, the number density 
of monopoles (or antimonopoles) produced should be of the order of (nri), 
where rj is the average radius of a bubble at a time when the bubbles have 
expanded to just fill the whole of space. The universe supercools at the first- 
order phase transition but reheats when the bubbles coalesce, so that the entropy 
density after reheating is 22 N, 73/45, as in the second-order case. Thus, for a 
first-order phase transition, 


3 
nm(Te) ag? (=) (3.125) 


4 4 =i 
nM 45 Naro (frre). (3.126) 
c 3 b 


The value of r, has been estimated [18] leading to a value of Ruh? even larger 
than in the second-order phase transition case. 

In either case, if magnetic monopoles form at a grand unified phase 
transition, some mechanism is required to dilute the monopole density by many 
orders of magnitude. The most obvious mechanism would be annihilation of 
monopoles and antimonopoles. However, this has been estimated [16] and there 
is no significant effect for nm/s S 10719 and, for larger values of nm/s, 
the annihilation process cannot reduce nm/s much below 107! For the non- 
supersymmetric case, this mechanism is ineffective and, for the supersymmetric 
case, it can do no more than reduce the monopole abundance closer to that for 
the non-supersymmetric case. A possible mechanism that can do the trick is 
cosmological inflation, which will be discussed in later chapters. 

After some mechanism has reduced the monopole abundance to a value 
compatible with the bound on Q,,h?, any residual monopole density can have 
important astrophysical consequences [19]. For instance, because the expectation 
value of the grand unified Higgs field approaches zero as the centre of the 
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monopole is approached, the SU(5) grand unified symmetry is essentially 
unbroken in the core of the monopole. Consequently, when a nucleon encounters 
a magnetic monopole, decay of the nucleon can be induced by the baryon-number 
non-conserving interactions of the lepto-quark fields of the SU (5) grand unified 
theory. In this way, the magnetic monopoles collected by stars in the course of 
time will cause emission of radiation from neutron stars. This puts severe limits 
on the flux of magnetic monopoles. More details of this and other astrophysical 
effects may be found elsewhere [19]. 


3.11 Exercises 


1. Calculate the energy of a kink or antikink soliton solution. 

2. Compare the total energy associated with a plane domain wall with n = 
100 GeV and area Hy ? with the known mass of the universe within a Hubble 
volume Hy 3, 

3. Derive equation (3.16) for the dependence of a global cosmic string on the 
cylindrical polar coordinate p. 

4. Check that the Lagrangian (3.23) possesses the local U (1) gauge symmetry 
(3.26) and (3.27). 

5. Show that the magnetic flux carried by a local cosmic string is given by 
(3.30). 

6. Derive the string equation of motion (3.61) and the constraints (3.62) and 

(3.63) by varying the action (3.55) with respect to X^ and hag. 

Derive the covariant derivative (3.83) for a monopole solution. 

Derive equations (3.87) and (3.88) for the form of a monopole solution. 

Derive equation (3.96) for the magnetic field due to a magnetic monopole. 

0. Estimate the monopole contribution to Qh? in non-supersymmetric and 

supersymmetric grand unified theories. 


= om 


3.12 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e Hindmarsh M B and Kibble T W B 1995 Rep. Prog. Phys. 58 477 

e  Vilenkin A 1985 Phys. Rep. 121 263 

e Kolb E W and Turner M S 1990 The Early Universe (Reading, MA: 
Addison-Wesley) 
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Chapter 4 


Baryogenesis 


4.1 Introduction 


The success of the standard model in describing the fundamental interactions has 
the consequence, among many others, of verifying the TCP invariance of nature. 
This requires that, for each particle X having mass mx, decay width l'x and 
quantum numbers Qx etc, there is an antiparticle X with the same mass and 
width, mg = mx, Tę = Ix but with opposite quantum numbers Qg = —Qx 
etc. One might, therefore, suppose that the world we inhabit would share this 
symmetry and contain equal numbers Nx of particles and antiparticles Ny = Nx. 
This is clearly not the case. We know that the solar system is made of matter 
(protons, neutrons, electrons) and not antimatter, and the experimental bound on 
antihelium is [1] 

THe L3. x 1075 at 95% CL. (4.1) 

"4pe 
Any region of antimatter must be well separated from regions of matter, since, 
in any region where protons and antiprotons coexisted, their annihilation into 
pions with the subsequent 79 — 2y decays would significantly distort the 
cosmic microwave background. The data require that such domains of matter 
and antimatter are separated by a length scale /g with, conservatively, 


lg 2 3 kpc (4.2) 
the radius of our galaxy, and probably [2, 3] 
lg = 10 kpc (4.3) 
the scale of the Virgo cluster. » 
The asymmetry between baryons (b) and antibaryons (b) may be quantified 
by the difference in their number densities ng = np — ng. However, the expansion 


of the universe dilutes both n, and ng and, hence, their difference, since, as 
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explained in section 1.4, each scales as R(t)73, where R(t) is the cosmological 
scale factor. It is, therefore, customary to use the ratio 


n= — (4.4) 
Y 


to measure the asymmetry. ny is the photon number density given by the 
Boltzmann distribution (see section 5.1) 


ny = 27T (4.5) 


when the temperature is T. From the measured microwave background [4], 
T = To = 2.725 + 0.002 K at present and this gives 


ny © 411 cm^?, (4.6) 


The present net baryon number density may be written in terms of the current 
critical density, defined in (1.37), 


3 
m2, Hà = 1.885? x 1077 g cm? (4.7) 


Dc = 8x? 


where h = Ho/100 km s^! Mpc^! measures the present Hubble constant, 
mp = Gy = 1.22 x 101? GeV is the Planck mass and (5] 


h = 0.0.7170-03. (4.8) 
Then 
ng = D = 1.1 x 10 5058? cm? (4.9) 
ms 
and 
n = 2.65 x 10 *055? (4.10) 


where 25 = pg/p: measures the baryon energy density as a fraction of the 
critical density. The measured primordial deuterium and hydrogen abundances 
require [5] 

Qgh? = 0.024 + 0.001 (4.11) 


which gives 
n = (6.36 + 0.26) x 107". (4.12) 


The conservation of entropy in a comoving volume, when the universe is in 
local thermal equilibrium, means that the entropy density s also scales as R(t)~>. 
Thus, the baryon asymmetry may alternatively be measured by 


ne = — (4.13) 
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where 2 
2x 
s= ay Best (4.14) 
with 3 3 
Ti 7 Ti 
S«S.T = D gi (2) + 8 > gi (2) (4.15) 


bosons fermions 


counting the total effective number of massless degrees of freedom at the 
temperature T, g; = | for a real scalar, g; = 2 for a real (massless) gauge 
field, g; = 4 fora spin-4 Dirac field and g; — 2 for a Weyl (chiral) field. We are 
allowing the possibility that different species are at different temperatures. When 
all T; = T, g,s, = N+ given in equation (1.104), and (4.14) reduces to (2.21). 
This is an excellent approximation until t ^ 1 s (or T ^ 1 MeV). However, as 
noted in section 1.8, it is not true today. The advantage of using ng as a measure of 
the baryon asymmetry is that it is conserved, as long as baryon-number-violating 
interactions occur very slowly. The relationship between s and ny is 


4 
m 
s = —-2, ~ 1.8 n 4.16 
45¢(3)° S,7My 8aS,TNy ( ) 
so 
n= 1.8gss,rne. (4.17) 


Thus, 5 is not constant in time, since g.s,r changes as the temperature drops 
and the number of effective massless modes decreases. The present entropy 
So = 7.0394n, 9 and the same data (4.12) give 


ng = (9.03 + 0.37) x 107. (4.18) 


So the challenge confronting theorists is to explain this small, non-zero 
number. The natural assumption is that ‘originally’ there was zero asymmetry. In 
equilibrium at a temperature T < | GeV, the nucleon and antinucleon densities 
are 


T 3/2 
ny = my 2 ("E ) emn/T (4.19) 


As the universe cools, the nucleons and antinucleons annihilate with a rate 
Pann = Ny (Gannv) (4.20) 


where (...) denotes thermal averaging, dann is the annihilation cross section and 
v is the relative velocity. The annihilation continues so long as the rate is larger 
than the expansion rate H of the universe: 


He 8p Pon 84,7 \'/2 T? 421 
(25) FC wa 


3m’, 3 5 mp 
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assuming that the energy density p = (x2/30)g+,7T‘ is dominated by relativistic 
particles. Here 


nY 1 T: \ 
ger = >> gi (£) +3 Ð gi (i) (4.22) 
bosons fermions 


satisfies g,, r = g«s,r = Ne (with N, defined in (1.104)) when all particle species 
i are at the same temperature 7; = T. (See section 5.1.) The thermal average 


(Cann v) ~ mz? (4.23) 


and, at T = T, ~ 20 MeV, the annihilation rate falls below the expansion rate, 
nucleons and antinucleons are so dilute that they cannot annihilate further and 
their number densities become frozen at 


nw ny 2n) (ay _my/Ty 
== e (4.24) 
ny, ny (3) \2nTyz 

~ 19718 (4.25) 


using (4.6). This is far smaller than the value (4.12) which derives from the 
measured primordial abundances of the light nuclei. Thus, the assumed zero 
initial asymmetry is inconsistent with the nucleosynthesis data. 
Of course, statistical fluctuations can generate a non-zero initial asymmetry. 
At present our galaxy contains about 107? photons and 10 nucleons. When 
T 2 | GeV, however, the comoving volume containing our galaxy contained 
about 107? baryons and antibaryons. Thus, statistical fluctuations might generate 
an asymmetry 
Ng — Ng - /N (4.26) 


so that, instead of (4.19), we have 


loo 19-395 
nn — hy JUN 10 nN (4.27) 
which again is far too small to explain the nucleosynthesis data. 

The conclusion is that the initial baryon asymmetry must be non-zero to 
explain the size of the asymmetry we observe today. Of course, the required value 
may be input by hand as an initial condition but aesthetically this is unattractive. 
The consensus is that the asymmetry derives from new physics in the early 
universe. We turn next to the three necessary conditions for baryogenesis, first 
derived by Sakharov [6]. 


4.2 Conditions for baryogenesis 


If we start from a universe with a net baryon number B of zero and evolve to 
one with a non-zero value, it is clear that baryon number is not conserved. Thus 
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the first condition is that there are baryon-number non-conserving interactions 
in nature. Aside from the baryon-number asymmetry itself, there is no direct 
experimental evidence of such interactions and any theory which contains them 
is constrained by the current lower bound on the lifetime (rp) of the proton [7] 


tp > 1071-1093 yr. (4.28) 


The generation of a non-zero baryon number for the universe, or 
*baryogenesis', also requires that there are C- and CP-violating interactions in 
nature. To see this, suppose that a processi — f, with initial state i and final state 
f. violates baryon-number conservation, so B; — By # 0. If charge conjugation 
C were an exact symmetry, then the process | — f, where is obtained from i by 
replacing all particles by their antiparticles and similarly for f, would occur at the 
same rate as the former. Since B; = — B; and B; = — Bp, the net baryon number 
produced by the two processes Bj — Bf + B; + B f is zero. À similar argument 
applies if CP-invariance were exact; parity reversal P reverses the momenta of all 
participating particles but when these are integrated over the (identical) allowed 
phase space the net baryon number produced by the two processes is again zero. 
The TCP-invariance of any particle physics model (T is time reversal) ensures that 
if there is CP-violation, then there is also T-violation and it is easy to see that if 
T-invariance were an exact symmetry, then baryon number would be conserved. 
Of course, we have long known that C-invariance is violated by weak interactions 
and that CP-violation occurs at the milliweak level in kaon decays [8]. There is 
also limited evidence for T-violation in kaon decays [9]. Thus there is no a priori 
need for new physics from this condition. 

The final condition is that the baryon-number non-conserving processes 
occur when the universe is not in thermal equilibrium. To see this, consider a 
particle X with non-zero baryon number in thermal equilibrium at a temperature 
T « myx. The number density nx of X particles is given by 


nx ~ gx(mx T) e mx*uT (4.29) 


where ux is the chemical potential. Likewise, in thermal equilibrium the number 
density n ; of the antiparticles X is 


ng ~ gx(nx T 2e mera (4.30) 


If X, X participate in baryon-number non-conserving processes, as required by 
the first condition, then the process 


XX > XX (4.31) 
is allowed and, in equilibrium, this requires 


2ux = 2uy. (4.32) 
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Then the asymmetry vanishes, since 
nXxnx —ny = 0. (4.33) 


Only a departure from thermal equilibrium will permit an asymmetry. Such 
a departure can arise, for example, during a phase transition in which gauge 
symmetry breaking occurs. It can also arise due to the expansion of the universe 
during the decay of a heavy particle. 


4.3 Out-of-equilibrium decay of heavy particles 


Given enough time any particle, no matter how weakly it interacts, will reach 
thermal equilibrium. However, in an expanding universe, it becomes increasingly 
difficult for any given species X of particle to remain in thermal equilibrium. 
This is because the expansion dilutes the densities of all particles with which 
X interacts and thereby inhibits the rate of the interactions needed to maintain 
equilibrium; and also because the rate of decay Fx of the X particles eventually 
falls below the expansion rate H of the universe and the decays are unable to 
reduce the numbers of X particles to the levels required to stay in equilibrium. 

Suppose X is a superheavy boson, having non-zero baryon number, which 
decays to lighter fermions f in a baryon-number non-conserving process. Gauge 
vector bosons and Higgs scalar particles with these properties arise naturally 
in grand unified theories (GUTs) which unify the strong and electroweak 
interactions (10, 11]. At high temperatures T >> mx, we assume that all particles 
are in equilibrium, and that the net baryon number B is zero. The number 
densities of the X particles and their antiparticles are 


3 
nx =ng= eo xT? (4.34) 


with gx = 2,1 corresponding respectively to X being a vector, scalar particle. 
Thus, using (4.5), 


= — = =gy. (4.35) 


To maintain equilibrium densities as the universe expands, the X and X particles 
must reduce their numbers sufficiently and, so long as they are able to do so, the 
net baryon number remains zero. For T S m x, the equilibrium densities will then 
reduce relative to that of the photons. From the analogue for X particles of (4.19) 
and (4.5), the relative densities are given by 

ng "E xm ( mx J emur, (4.36) 


ny ny, tQ) \2nT 


For baryogenesis, the most important quantity in determining whether thermal 
equilibrium can be maintained when T ^ mx is the decay rate [x of the 
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X particle, which controls the numbers of X and X particles. Unless this is 
sufficiently rapid, thermal equilibrium densities cannot be maintained as T falls. 
If at some temperature T > m y the X bosons cannot decay in the time scale H ^d 
associated with the expansion of the universe, then they decouple from the thermal 
bath while they are still relativistic and their densities satisfy (4.35). Thus at a 
lower temperature T € m x, their abundance is much larger than the equilibrium 
densities satisfying (4.36). The condition for this to happen is that 


Tx € Hlr-my - (4.37) 
Using (4.21), this requires 
3/572 
miz x (>) N, P prymp (4.38) 


assuming that all (relativistic) particle species are at the same temperature, so 
&x,7T = Ns. The overabundance which occurs when this condition is satisfied 
allows the possibility of baryogenesis. Whether or not it is satisfied depends upon 
the particular GUT model in which the X particles arise. 

If X is a superheavy gauge boson, for example, 


ry ~agmy (4.39) 
where ag = 8 /4n is the GUT ‘fine structure constant’. Then (4.38) gives 
-i/2 
mx 2 N, ““agmp. (4.40) 


For the non-supersymmetric SU(5) GUT (which, incidentally, does not satisfy 
the constraint (4.28) on the proton lifetime), 


ag~ d  mx-105Gev N= 22 (4.41) 


provided that the colour triplet, electroweak Higgs particles are superheavy, the 
constraint (4.40) is not obviously satisfied. However, for the supersymmetric 
SU (5) GUT, 


ac~} —mx-2x1l06Gev — N, 292 (4.42) 


the larger values of N, and mx outweigh the larger value of ag and the non- 
equilibrium condition (4.38) is marginally satisfied. 
However, if X is a superheavy Higgs particle, its decay width 


mf 2 
Ty ^ (=<) agmx (4.43) 
mw 


can be much smaller than that of the superheavy gauge boson, because the Yukawa 
coupling is suppressed (unless f is a top quark) by a factor m y / m w relative to the 
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gauge coupling [10, 11]. Then the condition (4.38) can be easily satisfied, since, 
in this case, we require only that 


2 
— m 
mx Zz N, 1/2 (27) agmp. (4.44) 


In supergravity GUTs, baryon number non-conserving interactions can also 
arise via hidden-sector effects [12]. Then X can be an observable-sector gauge- 
singlet scalar which is coupled only gravitationally to observable-sector fermions. 
In this case, 


m 
Dx — (4.45) 
mp 
and (4.38) gives 
mx S Ni’ mp (4.46) 


which is always satisfied. Thus, the general conclusion is that the decay of 
superheavy scalar particles in a supersymmetric or supergravity GUT affords the 
best opportunity for the out-of-equilibrium decays necessary for baryogenesis. 

As soon as the age H~! of the universe becomes equal to the lifetime ry of 
the X, X particles they begin to decay and generate a non-zero net baryon number. 
Using (4.37), this occurs at a temperature Tac, satisfying 


Arar, = Vx S Alramy - (4.47) 


Thus, from (4.21), 
Táec < mx. (4.48) 


Suppose that the X particle has decay channels X — f, to a final state fn 
producing baryon number B,. Then the X has decay channels X — fn producing 
baryon number — B,, and the net baryon number produced by all of these decays 
is 

AB Ty DU BT QC fa) - TC £1. (4.49) 
n 


This gives a net baryon number density arising from the decays of 

ng =nxAB — ny, AB (4.50) 
using (4.35). Thus, the baryon asymmetry (4.4) is 
n 


B ] 
~ -IgxAB . 
n, 38x (4.51) 


pz 


or 
ng || ASt(3) gx 

=z — ~ — AB. . 
"B s 21^ Ne (4.52) 
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As anticipated, AB and, hence, the baryon asymmetry, vanishes if none of the X 
decays produces a baryon number and if there is no C- or CP-violation. Further, 
if thermal equilibrium were maintained any net baryon number produced by the 
decays is cancelled by inverse decay. 

The foregoing analysis presumed that the X and X decays release no entropy, 
which is a poor approximation if Taec « my. In this case, the energy density p of 
the universe is dominated by X particles. If this is converted entirely into radiation 
at a reheating temperature Tz given by 


xz? 


3 
39 Ne T = nml (4.53) 


p> px C nxmx = 
87 


using (4.47), then 


1/4 
nx 3Tr (nir )" 


— = -— = - | — 4.54 
s Amy. A Ma N mS, 54) 
and the baryon asymmetry becomes 
3 T, 
np = -AB (4.55) 
4myx 


instead of (4.52). Either way, it seems that an encouragingly small amount of 
CP-violation is entailed to generate an asymmetry on the scale (4.18) observed. 
To see whether it is, we need to calculate AB in the various models containing 
baryon number and CP non-conservation. 


4.4 Baryogenesis in GUTs 


Grand unified theories (GUTs) seek to unify the three separate gauge groups 
SU(3), SU(2) and U(1) of the standard model in a simple group G: 


G D SU(3) x SUQ) x U(1). (4.56) 


(See [13] for a review.) The GUT hypothesis is that above some high energy 
(GUT) scale Mc, 
Mg > 10° GeV (4.57) 


G is an exact symmetry, which is spontaneously broken at the GUT scale to the 
standard model, which is itself spontaneously broken at the electroweak scale. In 
this way the (low-energy) gauge coupling strengths (a, a2, a@3) of the standard 
model are all determined from the unknown (high-energy) coupling strength ag 
of G, by using the renormalization group equations to ‘run’ between the GUT and 
the electroweak energy scales. We have discussed this in some detail in [10] but 
the essential point is that the evolution of the coupling strengths depends upon 
the matter content of the low-energy theory. Since neither ag nor mg is known a 
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priori, the GUT hypothesis can only be tested by starting from the values of the 
coupling strengths measured at the electroweak scale and running to high energies 
to see whether they converge to a single value (ag). 

The known matter content of the standard model consists of three generations 


of 
0.=(3,2,3) B-j L=0 
us =(3,1,-}) B=-} L=0 
d =8,1,}) B--l L=0 (4.58) 
LL=(1,2,-5) B-0 L=1 


e; = (1,1, 1) B-0 L=-1 


using the notation (”3,42,¥), where n3 specifies the colour SU(3) 
representation, n? the weak SU (2) representation and Y is the weak hypercharge. 
B and L are the baryon and lepton numbers (the superfix ^ indicates the charge 
conjugate particle). In addition, the electroweak Higgs 


hy = (1,2, 3) (4.59) 


is an essential ingredient of the standard model, whose discovery is currently 
awaited, hopefully at the LHC. If we assume just this matter content, besides the 
12 gauge vector bosons of the three gauge groups, it is found that the coupling 
strengths converge and reach a point of closest approach but not coincidence, at 
the energy scale and coupling strength given in (4.41). 

It is remarkable that the couplings come as close as they do and this in 
itself lends general support to the GUT hypothesis. However, the failure to 
converge precisely to a common value shows that if the GUT hypothesis is 
correct, then there must be matter additional to that of the standard model. 
Remarkably, the supersymmetric standard model, in which all of the matter 
particles (4.58) have supersymmetric (bosonic) partners (sparticles), all of the 
gauge bosons have (fermionic) supersymmetric partners (gauginos) and the Higgs 
doublet ^; has a (fermionic) higgsino partner, does produce the convergence 
sought [14]. The calculated unification scale and coupling constant are given 
in (4.42). (Supersymmetry requires an additional Higgs doublet 


hz = (1,2, -1) (4.60) 


plus its superpartner.) 

This convergence represents the best evidence we have both for the GUT 
hypothesis and for low-energy supersymmetry and it is, therefore, natural to 
wonder whether a GUT with this matter content produces baryogenesis at the 
level needed to produce the observed asymmetry (4.12) or (4.18). In a general 
GUT, the matter content (4.58) (and Higgs fields) of the standard model, or its 
supersymmetric extension, constitute partial or complete representations R of the 
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GUT gauge group G. The coupling to the gauge bosons Aa of the fermionic 
matter has the standard form 


£ — Y Ry" Gð, — gGAZt^)R (4.61) 
R 


where t^ are the matrix representations of G corresponding to the representation 
R to which the fermions belong. For the SU(5) GUT, each generation of 
(4.58) belongs to two irreducible representations 5 and 10 of the group, which 
decompose into representations of SU (3). x SU (2); x U (l)y as follows: 


5-(3,1,0 0,2, = ld. Lz) (4.62) 
10 = (3,2, $) + 8,1, -2 + (1,1, 1) = Qu u$, e$]. (4.63) 
Evidently the matrix #4 couples the gauge boson Af to fermions in 


the representations R and R, where R contains the complex conjugate 
representations, to those given in (4.58), i.e. 


Q,-(.2,-) B--4 L-0 
u = (3,1, B=} L=0 
d,=(3,1,-3) B=} L=0 (4.64) 
LL-(,21] B=0 L--1 
z -üL-) B-0 L=1. 


For baryogenesis, we are concerned with those gauge bosons which are coupled 
to fermions with a net non-zero baryon number. In the case of the SU(5) GUT 
with the gauge bosons in the adjoint 24 representation, all of the 12 gauge bosons 
additional to the 12 of the standard model have this property. They transform as 


(3, 2, —2) + 3, 2, $) (4.65) 


representations of SU(3) x SU(2) x U(1). We denote the (colour triplet) SU (2) 
doublet by (X, Y), and the SU(3) x SU(2) symmetry requires that 


mx — my = Mg. (4.66) 
The allowed decay modes 
X — dv, ul, dou (4.67) 
Y — dl, u^ u* 


are shown in figure 4.1. 
All violate baryon number conservation. In all cases the difference in the 
baryon number B and the lepton number L of the final state is 


B-L--$ (4.68) 
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Figure 4.1. Baryon-number non-conserving decays of SU(5) GUT gauge bosons. 
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Figure 4.2. Gauge vector boson vertices in the minimal SU(5) GUT. 


so we may consistently assign (X, Y) this value of B — L, which shows that B — L 
is conserved in all gauge-boson-mediated processes. B, however, is not separately 
conserved. Indeed couplings (4.67) and their complex conjugates induce the 
proton decay modes 


p met | aty (4.69) 


at a rate which, in the non-supersymmetric model, is calculated [15] to be about 
one hundred times the measured upper bound implied by (4.28). All of these 
couplings arise from the Feynman diagram vertices shown in figure 4.2. 

Higgs bosons are needed in a GUT to generate both the superheavy masses 
needed for the non-standard model gauge bosons and Higgs particles, as well as 
electroweak scale masses for the WF and Z gauge bosons. In the minimal non- 
supersymmetric SU(5) GUT, the electroweak Higgs doublet is accommodated 
in a 5 representation H, which in addition includes colour triplet scalars H3 
transforming as (3, 1, = of SU (3) x SU(2) x U(1). The Yukawa couplings 
have the form 


=J 
Ly = xunhpy! H +e xushu xi rj] Hy - hec. (4.70) 


where W, xj; are the 5, 10 representations of SU (5) which include the matter 
content (4.58) of the standard model. hy, hp are complex matrices (hu, p) fg 
acting on the (undisplayed) generation-space labels of y^ and Xu Jj. Then the 
colour triplet Higgs particles have decay modes similar to those of the X-boson 


H3 — ud, ul, dv (4.71) 


all of which violate baryon-number conservation. These are shown in figure 4.3. 
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Figure 4.3. Baryon-number non-conserving decays of SU(5) GUT colour triplet Higgs 


bosons. 


As before, in (4.68), the difference in the baryon number B and the lepton 


number L is the same for all decays: 


—_2 
B-L=- 


so B — L is conserved in all Higgs mediated processes. CP-violation arises from 
complex phases which cannot be absorbed by field redefinitions but, as we shall 
see, there is no contribution to AB at tree level. These couplings arise from the 


Feynman vertices shown in figure 4.4. 
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Figure 4.4. Higgs (5) boson vertices in the minimal SU(5) GUT. Single lines represent 5 
representations, double lines 10 representations. 


We have already noted that, in the minimal SU (5) GUT, the requirements 
(4.40), (4.44) necessary for departure from thermal equilibrium are more likely to 
be satisfied by the massive, colour-triplet Higgs scalar H3 than by the massive 
gauge bosons, so X,Y decays will not contribute significantly to the baryon 
asymmetry of the universe. Nevertheless, for completeness, we consider the 
contributions to AB defined in (4.49) from both sources. First we note that the 
tree-level contribution shown in figures 4.1 and 4.2 is zero. This is clear because 
in the Born approximation the process X — dv, for example, and X > dv 
derive from terms in the Lagrangian which are Hermitian conjugate to each other, 
so their amplitudes are complex conjugates. Since the kinematics of the two 
processes is identical, 


T(X > dv)lBom = P(X > dV)|Bom (4.73) 


and the contribution to A B given in (4.49) is zero. The same argument applies to 
the decays of Y and H3. 

At the next order, we need to include radiative corrections and look for (CP- 
violating) contributions to AB arising from the interference between the Born 
terms and these single-loop radiative corrections. For example, consider the 
radiative correction to the decay 


Hs > uflg (4.74) 


shown in figure 4.5 ( f, g are generation labels). 
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Figure 4.5. Radiative correction to H3 — ul. 


The matrix element has the form 
AM ~ (hphphu) ,, In (4.75) 


where Iy is the Feynman loop integral involved. Since the mass of the colour- 
triplet Higgs satisfies 

mg, >> my +m, (4.76) 
Ig is complex. To one-loop order, the square of the total matrix element M 
satisfies 


IMI? — | Mol? ~ 2ReLAMMII 
t t 
x 2Re[(hphphu) p.(hy) In] (4.77) 
(no summation). Mo is the amplitude for the Born approximation, shown 
in figure 4.3(b). For the corresponding antiparticle decay, we just replace all 


coupling constants by their complex conjugates and the difference between the 
rates is given by 


T(H3 — ug£g) — V (Hs > ü fg) x Im[( ph5hu) y, Gr), ,1 Im(/5). (4.78) 
Thus, when we sum over the generation labels the contributions cancel, since 
tr(h pho hyht,) = real. (4.79) 


In fact, all such one-loop interference terms are the absorptive parts of one 
or other of the two-loop diagrams shown in figure 4.6; the absorptive part is 
obtained by cutting the internal (fermion) loop and placing the cut fermions on 
the mass shell. The contribution just discussed arises from the absorptive part of 
the diagram in figure 4.6(g). It is clear that all of the other diagrams also make 
zero contribution to the baryon asymmetry [16]: the gauge couplings are real and 
the scalar couplings enter only in the combinations 


tr(hyh},) = real 
tr(h ph) = real. (4.80) 
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Figure 4.6. Two-loop diagrams giving one-loop radiative corrections to gauge boson and 
Higgs boson decays. 


The first non-zero contributions arise only from the three-loop radiative 
corrections to H3 decays, and the four-loop corrections to X,Y decays. 
For example, there are three-loop radiative corrections to H3 decay whose 
contributions to the difference between the decay widths of H3 and H3 is 
proportional to ImT, where 


T s wh huh huh i phi hp) (4.81) 
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which, in general, is non-zero [17-19]. The Yukawa couplings hy, p determine 
the fermion masses. The fermions in the three families of 10 and 5 representations 
are unitarily related to the mass eigenstates, the connection being given by 


8G 


hy = PmyQ (4.82) 
J/2mw 
hp = 8G RmpS$ (4.83) 
V2mw 
where P, Q, R, S are unitary 3 x 3 matrices and 
my = diag(m,, Mc, my) (4.84) 
mp = diag(mp, ms, m4). (4.85) 
Then 3 
2imT = —.- tr(m} [my Am% At, my Bm, B^]) (4.86) 
l6mi, 


where A and B are the unitary matrices 
A-P'R B=@QS'. (4.87) 
It is easy to see that the dominant contribution to the trace is proportional to (20] 
mjm?m. f (0) sind (4.88) 


where f(0) is a real function of the mixing angles characterizing the matrices 
A, B and 6 is a CP-violating phase. Remembering that the total decay width of 
the (colour-triplet) Higgs scalar is given by (4.43) with mf = m, the heaviest 
fermion, we conclude that the baryon asymmetry deriving from the minimal 
SU (5) GUT satisfies 


3m*m,m 
AB € (=£) PC, 10715, (4.89) 
^ Mn mS, 


However, using (4.52) and the observational data (4.18), we require that 
AB ~ 6N, x 1071? > 1077 (4.90) 


so there is no doubt that this mechanism cannot explain the measured asymmetry. 
In any case, we have already noted that this minimal theory gives an unacceptably 
high proton decay rate. 

The foregoing discussion suggests that to increase the predicted value of 
AB, we need to arrange that the asymmetry can arise via one-loop corrections to 
the Higgs decays. This entails enlarging the Higgs content. The simplest method 
is to include a second 5 of Higgs scalars H’, with a different mass or lifetime, 
whose couplings are of the same form (4.70) but with the coupling matrices hy, p 
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Figure 4.7. CP-violating, baryon-number non-conserving decays of X, Y bosons. 


replaced by hy, p. Then, besides the radiative correction to H3 decay shown in 
figure 4.5, there will be a similar diagram involving H; exchange. The difference 
between the decay rates now satisfies 


(Hs > u flg) — T (Hs > ü flp) x Im[hph huh, ]Im(g) — (491) 


instead of (4.78). In general, this is non-zero [18]. 

Unfortunately, such a model is unsatisfactory in other respects. It will have 
flavour-changing, Higgs-mediated neutral current decays [21] and, in addition, 
like the model with a single 5 of Higgs, it continues to possess the strong CP 
problem. (See section 5.3.1.) We shall see later that the most attactive solution 
to the latter problem utilizes a Peccei-Quinn U(1) symmetry [22]. This requires 
additional Higgs doublets, so that one doublet H is coupled only to dg quarks 
and the other doublet H’ only to ug quarks. This is realized in an SU (5) GUT 
by coupling one Higgs 5 to the matter (5)(10) and the other to the (10)(10) 
fields [23]. Thus, instead of (4.70), we have 


Ly = Xu hp! H^ + xunhy xiki Hye + hc. (4.92) 


This looks as though the one-loop radiative correction shown in figure 4.5 is 
now forbidden, since the exchanged Higgs has to have both D-type and U- 
type couplings. However, it is allowed, since the Higgs mass eigenstates mix 
H and H'. Nevertheless, there is no CP-violation and, hence, no contibution 
to the baryon asymmetry, for the reason (4.79) given earlier. Instead, in this 
model, the (CP-violating) baryon asymmetry arises from the decays of X, Y 
gauge bosons with a pair of Higgs bosons in the intermediate or final state [24,25] 
(see figure 4.7), and a sufficient asymmetry arises for a wide range of parameters 
(17, 26]. 

Another variant of the SU(5) minimal model is to introduce a Higgs 
multiplet belonging to a different representation, i.e. not a 5. Since, in SU (5), 


5x5-—104 15 
5x10—5-r45 
10 x 10 = 5 + 45 + 50 (4.93) 
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Higgs belonging to 10, 15, 45 or 50 representations can be coupled to the matter 
fermions. In particular, the introduction of a 45 Higgs representation also allows 
the generation of sufficient baryon asymmetry for a wide range of parameters [17]. 


4.5 Baryogenesis in SO(10) GUTs 


We have discussed the SU (5) GUT at some length, despite its inadequacies. 
There are, however, other GUTs, based on larger (higher-rank) groups, which 
some consider to be more attractive, and which, in any case, possess features 
which are not present in the SU (5) GUTs. In particular, as a vector-like theory, 
SO(10) has the desirable feature of being automatically anomaly-free. Another 
attraction is the fact that it includes all of the fermions of one generation within 
a single irreducible 16-dimensional (spinor) representation of the group. In terms 
of the representations of SU (5) x U(1) C SO(IO), it decomposes as 


16 = (1, —5) + (5, 3) + (10, —1) (4.94) 


with the second number specifying the U(1) charge of the SU(5) irreducible 
representation. The 5 and 10 irreducible representations are precisely those to 
which the matter fermions are assigned in SU (5), shown in (4.62),(4.63), and the 
extra SU (5) singlet state, which must exist if SO(10) is correct, could be a right- 
chiral neutrino vg ~ vj: this latter possibility looks feasible as well as attractive 
in the light of evidence [27} for neutrino oscillations which can, of course, only 
arise if neutrinos have (different) masses and, therefore, possibly right-chiral 
components. (The existence of such a state also allows for the possibility of 
a Majorana mass term.) The inclusion of both of the chiral components of all 
fermions within the same GUT representation means that charge conjugation (C) 
and parity reversal (P) are naturally symmetries of the GUT and are spontaneously 
broken when the GUT breaks to a gauge group which does not have this property. 
This occurs at a high scale if, as it might, SO(10) breaks to SU(5) or to 
SU (5) x U(1). This happens if, for instance, the GUT symmetry-breaking Higgs 
are also in a 16 representation. 
However, there are other possibilities. Clearly 


SO(10) 5 SO(6) x SO(4) (4.95) 
so, since 


SO(6) = SU(4) 
SO(4) = SU(2) x SU(2) (4.96) 


one possibility is that the $O(10) breaks at the GUT scale as 


SO(10) € su(4) x SUQ), x SUOR = Gan. (4.97) 
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This happens if, for instance, the GUT symmetry-breaking Higgs transforms as a 
54-dimensional representation. Then the 16-dimensional fermion representation 
is given by 


16 = (4, 2, 1) + (4, 1, 2) 
_ uj Ve d; e 
ECOJ CS]. e» 


for the first generation. At later stages, the SU (4) breaks at a scale M, and the 
SU (2) a breaks at a scale Mg as 


su(4) 5 SUY, x UOY (4.99) 
and 
Mg 
SUQ)a Z U(Dg. (4.100) 
So either M 
c M M 
Gan  G3122 > G3121 > Gsm (4.101) 
or 
M M. M 
Gan — Gar, 3 Gaii > Gsm (4.102) 
where 
G3122 = SU (3, x U(1Y x SUQ), x SU(2)R (4.103) 
G3121 = SUG)e x U(1Y x SUD), x U(1)r3 (4.104) 
Gazı = SU(4) x SUQ), x UD (4.105) 


and Gsm = SU (3)e x SU(2), x U(Dvy is the standard model gauge group. 
The content of the 4 representation of SU (4) under the decomposition (4.99) 
is 
4-3,D40,-0 (4.106) 
since U (1)' is a (traceless) generator of SU (4). We have used a normalization of 
the hypercharge Y’ which shows that, for fermions only, 


y’=B-L. (4.107) 


This was first noted by Pati and Salam [28] and, for this reason U (1)' is sometimes 
denoted U(1)s—z. At any rate, unlike the SU(5) GUTs, it is possible to break 
B — L conservation in SO(10) models. As previously noted, the scale M; at 
which U(1) and, therefore, B — L conservation is broken is not necessarily the 
scale at which SO(10) is broken. 

In fact, the SO(10) group has an element D which interchanges the charge 
conjugate doublets within the 16 representation (4.98). To see this, we choose 
the decomposition (4.95) so that the Cartan subalgebra of S O (6) is generated by 
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the SO(10) generators M!2, M34, M56 and that of SO(4) by M78, M910. The 
Cartan subalgebras of SU(2)z,r are then generated by Tj p = 4(M78 + M910), 
We first note that 

D = MP MS (4.108) 


where 
(M*5), = 878? — 848? (4.109) 


is an element of SO (10). The action of D on the underlying 10-dimensional space 
is to reflect the coordinates x^ «> —x^ (a = 2,3,6,7) leaving the remainder 
invariant. Thus, the effect of D on the Cartan subalgebra of SU (4) = SO(6) is 
to reverse the signs of the generators 


D; 73815 _, 5805 (4.110) 
while on that of SU (2); x SU(2)a 
D:Tj e Tg. (4.111) 


Thus, - 
D : (4,2, 1) e (4,1, 2) (4.112) 


as asserted. 

Because of this, unlike the SU (5) GUTs, a baryon asymmetry is often not 
generated at the GUT-breaking scale but rather at the lower scale M; at which 
U (1y is broken. Whether or not this happens, of course, depends upon the Higgs 
fields responsible for this symmetry breaking and their coupling to the matter 
fields. The Higgs fields that couple to matter must be in one or more of the 
SO(10) representations occurring in the product 


16 x 16 = 10, + 120, + 126,. (4.113) 


For example, with the colour-triplet Higgs particles belonging to the 10- 
dimensional representation, the symmetry (4.112) ensures that 


r(H > £0) = T (Hi > £Q) (4.114) 


at one-loop level, so no baryon asymmetry results, similarly in the Q Q channels 
[29]. However, if the Higgs content is enlarged to include a 45-dimensional, a 
126-dimensional and an additional 10-dimensional representations, then with a 
suitable choice of (45) we can break 


$0(10) > G1 (4.115) 


in a way which breaks the D-symmetry (G3122 is defined in (4.103)). This is 
done by ensuring that the non-zero VEVs are odd under D. In addition, the 
extra Higgs content splits gz and gr, the coupling constants of the SU (2)L, R 
groups, and ensures that the colour-triplet Higgs mass eigenstates are complex 
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superpositions of those coming from the two 10s. This is sufficient to generate a 
baryon asymmetry at the GUT symmetry-breaking scale [29]. 

The lesson to be learnt from these considerations is that a baryon asymmetry 
can arise in such theories but that the energy scale at which it arises may be 
much lower than the GUT scale. The magnitude of any such asymmetry depends 
sensitively on the details of the particular model but, in many models, there is 
ample room in parameter space to accommodate the observed asymmetry. 


4.6 Status of GUT baryogenesis 


The discussion in the two previous sections of baryogenesis using the baryon- 
number non-conserving interactions of a GUT was based upon the assumption 
that the superheavy GUT gauge bosons or Higgs particles whose decays produce 
the desired baryon asymmetry are, in the first place, in thermal equilibrium 
and then, as the universe expanded and the temperature dropped, came out of 
equilibrium when the condition (4.37) was satisfied. It is at least debatable 
whether this assumption is well founded. 

We shall see in chapter 7 that there are strong theoretical reasons, and some 
support from observational data, for believing that the universe went through a 
period of ‘inflation’, during which the scale factor grew by a factor of order 
1077, so that the observable universe evolved from a single Hubble volume. It 
is, therefore, essential that the baryon asymmetry we observe was generated after 
inflation: any asymmetry generated earlier would be so diluted by the inflation 
as to render it utterly unobservable at the present time. This is the source of the 
difficulty with the scenario envisaged hitherto. To generate the required amount 
of inflation requires the inflaton potential to be rather flat: (the ‘inflaton’ (4) is the 
presumed field whose evolution determines how much inflation actually occurs). 
This means that the mass of the inflaton is relatively low [30], in the range 


mg € 10'3-10 GeV (4.116) 


to account for the observed flatness and homogeneity of the universe and to solve 
the horizon problem!. When it reaches the minimum of its potential, the inflaton 
oscillates about its value at the minimum. As it does so, somehow, this low- 
entropy, cold universe evolves into a hot universe dominated by radiation. The 
key question is: What is the temperature of this ‘reheated’ universe? This is a 
vital question because the bound (4.116) means that, in some cases, 


mg < 2mx (4.117) 


where myx is the mass of the particle whose decays generate the baryon 
asymmetry. This is the case in the minimal SU(5) GUT, for example, where 


l This is because m2 ~ V^ (9), and the double derivative is constrained by the condition (7.45) with 
V (4) satisfying (7.117). 
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X = Hy is the colour-triplet Higgs particle with 
mu, 2 10" GeV. (4.118) 
In these circumstances, the decay 
$— XX (4.119) 


is kinematically forbidden, so if X particles are created, they must be created 
by thermal production in the reheated universe. (This is why our comments in 
section 4.5 about the possibility in other GUTs of baryon asymmetry arising at 
a scale well below Mg are pertinent.) So the next question is: What is the 
abundance of the out-of-equilibrium X particles thus created? It is beyond our 
scope to discuss here the calculation of the reheating temperature, the abundance, 
and other questions which arise, in any detail. The interested reader is referred 
to [31] and references therein. We shall content ourselves with noting the 
conclusions which have been reached from these studies. A mechanism called 
‘parametric resonance’, deriving from nonlinear quantum effects, leads to a 
phenomenon called ‘preheating’ during which copious production of X particles 
occurs even though these are heavier than the inflaton. It appears that the out- 
of-equilibrium scenario, upon which our analysis was predicated, arises naturally 
and that the right amount of baryon asymmetry is produced for a very wide range 
of decay widths of the X particles. 

Notwithstanding this highly welcome outcome, one is naturally led to 
wonder whether the observed baryon asymmetry might not have a different origin. 
Although, as we have noted, there is evidence of (supersymmetric) unification of 
coupling strengths, this does not necessarily entail the existence of a GUT. It is 
quite conceivable, in the context of string theory for example, that there is no 
GUT. If so, the observed baryon asymmetry must have a different origin. This is 
the topic to which we now turn. 


4.7 Baryon-number non-conservation in the Standard Model 


It is easy to see that the standard model Lagrangian, having the local gauge 
symmetry group SU (3), x SU (2); x U(1)y, is also invariant under the (classical) 
global U (1) transformations associated with the baryon number (B) and lepton 
numbers (Ne, £ = e, u, r) in which fermion fields y(x) transform as 


v(x) > 8? (x) (4.120) 
v(x) > NF y (x). (4.121) 


When 9 is local, the first of these (4.120), for example, applied to the kinetic term 
produces a change ôS in the action 


ôS = — f d'x (y^ By y/)8,0 
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= f dx 08, (V y^ By y) (4.122) 


where By is the baryon number of y. Since the remainder of the Lagrangian 
density is invariant under the (global and therefore the local) transformation, the 
(Noether) current 


jP = 2 Vy,By V (4.123) 


associated with baryon number, is divegenceles 
8^ je =0 (4.124) 


and B is conserved at every order of perturbation theory. Similarly, for the lepton 
currents, 


jl? =} byl (4.125) 
Y 


However, classical symmetries such as these are generally not preserved at the 
quantum level [32-34]. In particular, in a chiral theory, in which the left- and 
right-chiral fermion states are coupled differently, as occurs in electroweak theory, 
there are chiral anomalies resulting from the non-invariance of the field measure 
Dy Dy in the functional integral determining the generating function [34]. This 
non-invariance is equivalent to a further change ôS’ in the action 


1 - 
8s = isi f d x o(x)[uf{ FOS, Fe") By — e((FOD, FBI] (4.126) 


where Fry (Da = Fi (D is the gauge field strength coupled to the left-chiral 
component "m jid — ys)¥ of yr; t? is the generally matrix-valued coupling 
of the representation of the gauge group to which the left-chiral component wz 
of the fermion belongs, FD = 5€yvpo F??7( is the dual field strength and {, } 
denotes anticommutator. The trace is over all fermions y. Similarly for FCD, 
Thus, combining this with (4.122), we see that quantum effects require that 


;(B) — 


] - 
9^ j(D = —— F, FOO) B — (OD, pr]. (4.127) 


321 30432 uv? 


For the standard model, we note that there is no contribution to the divergence 
from the SU (3), group, because, for each quark flavour, 


FO) = FA. (4.128) 


The left-chiral quark states Qz of the first generation are coupled both to the 
SU (2); field strength Wiv (a = 1,2, 3) with t? = gii (1? are the 2 x 2 Pauli 


matrices) and to the U (1)y field strength B,, with strength giYz = g1 l. The 
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right-chiral quark fields ur, dg are coupled only to Bj, with strength giYg = 
£1 2, £ E. respectively. Thus the baryon number current 


jp =}} ana (4.129) 
q 
satisfies N. 
: G ~ ~ 
9^ i = S2 2 Wus WY” — gi Buv B] (4.130) 


where NG = 3 is the number of fermion generations. (It is important to remember 
that each quark flavour q occurs in three colours.) The meaning of this important 
equation is that the quantum average, the expectation value, of the divergence is 
equal to the expression on the right-hand side in a fixed background field given 
by the gauge fields W; (x) and B, (x) [35]. 

In the same way, it is easily seen that the lepton number currents 


iL? 2 tyuve +Eyat (=e, n t) (4.131) 
satisfy 
1 
(Lo) — (B) 
9" jL0 = Ne 9^ jc (4.132) 


and it follows that (1/NG) B — Le is conserved for each £. Also, defining the total 
lepton number 


Lz yu (4.133) 
t 


we see that B — L is conserved, even though neither B nor L is. 
Both of the field-strength factors on the right-hand side of (4.130) are 
expressible as total divergences: 


B,, B"" = 9,k" (4.134) 
where 
k^ = e""?? B, B, (4.135) 
and . 
wa, wa? = 3K" (4.136) 
where 


K" = e" [W^ We + 1g,e* We WW] 
= 2e"99 tr WipWo — i$ 92W,W5 Wa]. (4.137) 


Note that, although the divergences à, k^ and 9, K^ are gauge invariant, the 
individual currents are not. In the first instance, let us consider the (classical) 
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background vacuum fields in the SU (2); x U(1)y theory. The standard solution 
is to take all fields to be zero except for the Higgs doublet for which 


gre = =; ( 0 ) zo (4.138) 


where v is constant. Both K^ and K^ are then zero, of course. Because of the 
SU (2),, x U(1)y gauge invariance of the Lagrangian, we could as well choose a 
gauge transformation of this solution. Then, 


$"* (x) = U(x)U1()9o (4.139) 


where U| (x) is a general element of U (1)y and U(x) is likewise a general element 
of SU (2),, so we may write 


U (x) — a(x)I tir - b(x) (4.140) 

with 
det U(x) = a(x}? + b(x) - b(x) = 1. (4.141) 
Thus, U (x) can be regarded as a mapping from spacetime into the three-sphere 


(S3) which is the group space of SU (2). In this gauge, the U (1) vector potential 
is non-zero: 


By = Z UU; (4.142) 


but, of course, the (gauge-invariant) field strength B//7 is still zero and k” remains 
Zero: 
ke =0. (4.143) 


Similarly, in this gauge, the SU (2) gauge field becomes 


i a]l 
wx = 2; ROU = at -Wa (4.144) 
and, with the parametrization (4.140), this gives 
2 
w = -yz Crb — bð a + 94b x b) (4.145) 


For future reference, we note that, using (4.141), 


4 
Wit Wit = zala) Oa) + (Bub) - GB) 
2 


4 
= Yu» (4.146) 
82 


where y, is the metric on S? for the spacetime coordinates. The vacuum state 
described by (4.144) may be taken to be time-independent and such that U > h 
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as |x| — oo. Then U effectively maps the S? of real space, obtained by 
identifying points at infinity, into the S? which is the SU(2) group space [36]. 
Such mappings are characterized by their homotopy class. Since 


m$?) = Z, (4.147) 


where Z is the group of integers, any such mapping is associated with an integer 
which counts the number of times the spacetime 5? is wrapped around the (unit) 
S? of the internal SU (2) space. 

Now, the field strength W^ constructed from W"* is zero but the current 
K" (4.137) is no longer zero: 


4i 
Kuve — eue trf wy Wy Wo (4.148) 
So 
K Ovec = R eh gabe (ovis pe cv) 
= —2godet W*, (4.149) 
Also 
det W"* = ydet G (4.150) 
where 4 
82 


and y;j is the metric (4.146) on S? for the spatial coordinates. Thus, if we define 
K= f dx Kove (4.152) 


we have that 


= — ——n (4.153) 


where n is an integer which counts the number of wrappings of the internal $? 
provided by the mapping U; the volume of the unit S? is 272. For a general 
(non-vacuum) field configuration, 


K = —z Nes (4.154) 
85 
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where Ncs is the Chern-Simons number. It derives from the existence of a 3- 
form Ka) which arises because the 4-form tr(W2) A Way) is exact; Wo) = 
Wy y dx” A dx" is the field strength 2-form. Thus, in our (vacuum) case, Ncs 
is just (minus) the winding number n. 

Evidently electroweak theory, unlike QED, has an infinity of topologically 
distinct vacua, which may be labelled with their Chern-Simons number. These 
vacua are not physically distinct [36]: they are gauge transformations of each 
other, as we have emphasized. Now consider the total baryon number 


B= f gi, jo» (4.155) 


and let the gauge and Higgs fields be general and have time dependence. Then 
the baryon number B will also be time-dependent. In a time interval (ti, tf), the 
change is 


tf 
AB =f dt 9B = fes 9, j^ (4.156) 
ti 


assuming that j?) vanishes at spatial infinity. Note that using (4.130) this is 
gauge invariant. Using (4.134),(4.136) and (4.143), we see that 


N, 2 
AB= - 3 ferou" 
E Nos 
322? 


Now suppose that in this time interval the gauge and Higgs fields traverse a 
non-contractible loop in the field configuration space, starting and finishing in a 
vacuum configuration. Then the change AK is given just by the vacuum formula 
(4.153), which gives 


AK. (4.157) 


AB = —NgG ^n = NG ANCcs. (4.158) 


In other words, in the standard model, baryon-number non-conservation arises 
in integer multiples of Ng whenever the initial and final vacuum states are 
topologically distinct. 

It is instructive to consider what is happening using Dirac's picture of the 
vacuum as the state with all negative-energy fermion levels filled [37]. We start 
(at tj) and finish (at tf) in this state. In the presence of non-zero field strengths 
and Higgs fields, we expect the energy levels to be displaced at intermediate 
times. In the (trivial) case of QED, for example, although the levels are perturbed 
from their values at f;, they each return to their original level at ty, because the 
electromagnetic field interacts with the left- and right-chiral fermion components 
with equal strength. However, the baryogenesis with which we are concerned 
derives from the chiral nature of electroweak theory: only left-chiral fermions 
interact with the SU (2) gauge bosons. Depending on the field configurations, this 
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ti t 


Figure 4.8. Fermion energy-level crossing in electroweak theory. 


chiral property allows the energy levels of all of the (infinite number of) negative 
energy states to be raised by one (or more) level such that at t y all of the negative 
energy levels remain occupied. In the process, one (or more) of the positive energy 
levels is occupied and we see one (or more) fermions produced. (See figure 4.8.) 
Of course, if just one fermion is produced, angular momentum is not 
conserved, and, in fact, electroweak theory with just one doublet is inconsistent: 
it has a chiral anomaly. In the realistic case, each generation has four doublets: 
three (colours of) quark doublet and one lepton doublet. Thus, with ng = 3, 
there are 12 doublets in all and with the minimum of just one level crossing, 12 
fermions are created, nine quarks and three leptons. An allowed process, which 
has total charge zero, so charge is conserved but has baryon number 3 and each 
lepton number 1, so that Ng — iB is conserved might create from the vacuum 


uudeuddv,uddv, (4.159) 


or, equivalently, 
pn — net VV. (4.160) 


4.8 Sphaleron-induced baryogenesis 


At intermediate times between t; and ty, there are non-vacuum field 
configurations, which necessarily have higher energy associated with them. The 
situation is, therefore, analogous to a particle moving in a one-dimensional 
periodic potential V (x), in which a potential barrier separates adjacent minima. 1f 
the particle has energy E less than the barrier height, classically it remains trapped 
in one of the valleys of the potential, oscillating between the turning points where 
the total energy E — V(x). However, quantum mechanically there is a non-zero 
probability of penetrating the barrier. In the semi-classical approximation, the 
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amplitude for transmission is given by the WKB formula in which the amplitude 
is suppressed by an exponential factor 


T =exp( - [ par) (4.161) 
Xo 


where p(x) = 4/2m[V (x) — E] and the limits of integration xo and x; are the 
two points at which the kinetic energy of the classical particle is zero, so that 
V(xo) = E = V(xi). It is convenient to choose the zero of energy such that 
E = 0. Suppose that xo is a minimum of V (x), so that we are considering the 
probability of a particle in classical equilibrium tunnelling through the potential 
barrier. In several dimensions, (minus) the exponent in (4.161) generalizes [38] 


to 
x 
b =f ds 2m VG) (4.162) 
Xi 


0 


(where ds? = dx - dx) and the integral is to be evaluated along the path for which 
b is a minimum. The required path x(t), therefore, satisfies 


d^?x 
—_=VV 4.163 
de ( ) 
with l dr dx 
—m—-+-— — = 0. 4.164 
2" dt dr Ve) ( ) 


Equation (4.163) is just the Euler-Lagrange equation for the imaginary-(or 
Euclidean-)time version of Hamilton's principle, in which the formal substitution 
r — it is made. In other words, it minimizes the Euclidean action 


SE = fo Le (4.165) 

where | dr dx 
= -m— - — + V(x). 4.166 
LE z" dr + V(x) ( ) 


Equivalently, it describes the motion of a particle in time r moving in the inverted 
potential — V (x). It is clear then that the classical equilibrium point xo can only 
be reached asymptotically, as r — —oo 


lim x — xo. (4.167) 
—oo 


to 


We can choose the time at which the particle reaches x1, where dx/dr = O next, 
to be r = 0. Then the exponent b may be written as 


x 0 
b= | “ds mV) = | dt Le. (4.168) 
Xo —00 
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For positive t, the motion is just the reverse of the motion for negative t, so that 


oo 
b= if dt Lg = $SE. (4.169) 

-00 
Thus, the quantum-mechanical transition rate T' at which the particle tunnels 
through the potential barrier is exponentially suppressed as [39, 40] 


Tae Se, (4.170) 


In a quantum field theory, to find the transition rate between adjacent vacua 
we need a field configuration which interpolates between them in Euclidean 
spacetime, as in section 2.9. We can get a feel for what is involved using an 
observation made by Belavin et al [41] and ’t Hooft [42]. For a Euclidean 
spacetime, 


f dix (W8, — Wa)? > 0. (4.171) 


So, ignoring the Higgs contribution, the action is 
Sc=} f díx wa wa -—1 J dx Wz, We, 
>} f dêr wa We, 


d*x 3p K" 


ANcs (4.172) 


1 

à 

8x? 
83 


where we have used (4.136) and (4.154). So a gauge field configuration which 
interpolates between vacua with ANcs = 1 will have Euclidean action 


2 
Sg(1) > Z. (4.173) 
a2 


Already we can see that the tunnelling probability is likely to be incredibly small, 


since 
exp[—Se(1)] ~ 10-90 (4.174) 


using 
ay! c og sin? 0y ~ 30 (4.175) 


as suggested by current data. Of course, (at zero temperature) in a pure Yang- 
Mills theory, such as this, nothing sets the overall scale, so we may not yet write 
down the tunnelling rate per unit volume. To do this, we need to consider the 
spontaneously broken theory. 

A related problem is to determine the energy scale of the potential barrier 
separating adjacent minima. The schematic diagram in figure 4.9 suggests that 
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Figure 4.9. Schematic picture of the dependence of the static energy of the gauge-Higgs 
system upon the field configuration [A, $]. The minima are topologically distinct vacua, 
labelled by the integer-valued Chern-Simons numbers. The maxima, on this slice, are 
unstable sphaleron configurations. 


the barrier height will be the energy associated with a field configuration having 
Chern-Simons number Ncs — 5. (Remember that Ncs is only integral in a 
vacuum field configuration. Manton [36] and Klinkhamer and Manton [43] 
constructed just such an object. They assumed a static, spherically symmetric 
ansatz in which the Higgs field 


ó(r) = h(mwr)U (r)jo (4.176) 
with 0 

o2 4.177 

#-3(0) em 


as before, mw = lev and 
W; = z f men uu" (4.178) 
2 
where U has the form (4.140) 


l 
U= zeh +iyti +ixt2] (4.179) 


and » 
r - dri — Q2 +y 2). (4.180) 


h and f are functions to be determined. The energy of the field configuration is 
given by 


E= fè [4 tr(Wi; Wij) + 1Bi; Bij + (Did) (Did) + V(O (4.181) 
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where Wav, Buy are the usual field strengths, Dj? is the gauge covariant 


derivative 
Did = dip + igaWid + igi Bid (4.182) 


and 
VeA) =a(olg — i). (4.183) 


In the first instance, the U(1) fields are ignored, i.e. Ow = 0. Substituting the 
ansatz gives the energy in the form 


eo 
= mw ("ar (e PP AMS me) (4.184) 
a2 Jo mw 

where £ = mwr, so that f and h are functions of & alone, and m2, = 2Av? is the 
Higgs mass. The Euler-Lagrange equations which follow from requiring E to be 
minimized may be solved approximately or numerically, with f and h required 
to approach zero as E — oo. Substituting the solutions back into E gives the 
minimum energy in the form 


win = We (24). (4.185) 


The scale is set by the prefactor 


2 
T ~5TeV (4.186) 
a2 
and the function € is rather slowly varying: it increases only from 1.5 to 2.7 as 
my increases from zero to infinity, taking the value 2.1 when my = 2.8my. In 
the range 25 GeV < my < 250 GeV, £ is well approximated by 


E(x) = 1.58 + 0.32x — 0.05x?. (4.187) 


Allowing Ow # 0 changes Emin by about 100 GeV. 

Although the configuration has the minimum energy among those satisfying 
the ansatz, it is nevertheless unstable against perturbations which do not satisfy 
the ansatz. This was to be expected too from the schematic diagram in figure 4.9, 
in which the top of the potential barrier is evidently a maximum. Because of this 
instability, the configuration was named a ‘sphaleron’, from the Greek meaning 
‘ready to fall’. The energy of the sphaleron satisfies 


8 TeV < Espn < 14 TeV (4.188) 


and measures the height of the saddle point in configuration space over which 
the vacuum must be ‘pushed’ to reach a topologically distinct vacuum. This 
suggests that the exponentially small tunnelling rate (4.174) might be evaded by 
supplying O(10 TeV) of energy, for example in a pn collision. Then some baryon 
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number non-conserving transition, such as (4.160), might be observed. Such 
energies will soon be available at the LHC. However, it is not just the potential 
barrier which has to be surmounted. The energy scale (4.188) suggests that about 
30 weak gauge bosons must be created and assembled into the highly coherent 
sphaleron configuration, so that the process is highly suppressed by phase-space 
considerations unless the energy is considerably in excess of the sphaleron mass. 

The other possibility, and the one which is of primary concern to us, is that, 
at temperatures kgT ^ Espn, thermal effects might give baryon number non- 
conservation at an appreciable rate. To calculate the thermal transition rate we 
must remember that the effective potential becomes temperature dependent [44] 
and, in consequence, so does the the Higgs field expectation value v(7) and, 
hence, the W-boson mass m w (T). In fact, using (2.55), since A in (2.93) is l that 
in (2.42), 


1/2 
my(T)= 580(T) =myp A + ( — z) (4.189) 
where my 
Im| = Av = Vi (4.190) 
at tree level, 
C= 3e3(1 + 2cos?0y) (4.191) 
4x sin?20y 
in electroweak theory and, from (2.54), 
-12 
T) = To ( T s) (4.192) 
with Tọ given by (2.95) 
-1/2 
To = Im| E + ae + d . (4.193) 
The measured values of the parameters give 
133 GeV € To < 360 GeV (4.194) 
with the lower bound deriving from the current lower bound [7] 
my Z 114.3 GeV at 95% CL (4.195) 


on the Higgs mass. Then 


TZD <23 x107 (4.196) 
To 
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with the upper bound saturated when my achieves its lower bound. The 
temperature-dependent sphaleron energy satisfies 


2mw(T 
Eqpn(T) = De (4.197) 
e» 
with 
19<€ <2.7 (4.198) 


and the associated Boltzmann factor exp[— Espn(T)/ T] means that the baryon- 
number non-conservation is unsuppressed when 


T > Es (T). (4.199) 


However, this is satisfied only when T is practically at the critical temperature at 
which the energy barrier disappears in any case. Careful analytical estimates of 
the baryon-number changing rates, valid in the range 
2mw(T 
2mw(T) « T « emet (4.200) 
2 
have been made by Arnold and McLerran [45]. They find a transition rate per unit 
volume 


r 3 
sph _ «my ry ( ZD) e^ Ema (T)/T (4.201) 
V aT 


where the numerical factor x ~ 11, although a (non-perturbative) numerical 
‘measurement’ of the diffusion rate of the Chern—Simons number over the barrier 
[46] suggests that this may be too large by a factor of order 10. 

At high temperatures where T > Ti, the Higgs field VEV v(T) is zero 
and there is no sphaleron. The gauge fields, however, can still generate baryon- 
number non-conservation. The energy Ep of such a configuration is controlled by 
the a priori length scale £ of the configuration which changes the Chern-Simons 
number Ncs. Presumably the scale of the action is set by (4.173), so 

2n 


Ept ~ —. (4.202) 
a2 


To avoid Boltzmann suppression, we require 


2x 
E~ eT (4.203) 
al 
so 2 
e>—. (4.204) 
aT 


Thus, the transition rate per unit volume is 


T, 1 aT M 
v ~ Bir ~ p (4.205) 
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if we assume £ ^ t. However, the scale (4.204) is also the scale at which 
perturbation theory breaks down in a hot plasma and it has been argued [47] that 
plasma damping effects increase the time scale, so that 


t 
t~ — (4.206) 
a» 
basically because there are fewer ways large field configurations can cross the 
barrier than smaller ones. This means that 


4 


and indeed a lattice simulation [48] gives 
Tr 
voc 29 + 6)a3T*. (4.208) 


Numerically there is no difference between the two expressions. 

At any rate, it is clear that there is no Boltzmann suppression at high 
temperatures, so we do have a source of baryon-number non-conservation. 
Whether or not it can explain the observed baryon asymmetry of the universe, 
of course, depends upon the other two Sakharov criteria: the amount of CP- 
violation and whether the system is out of thermal equlibrium. The picture we 
have in mind is that, in some region of space, there is a non-trivial gauge and 
Higgs field configuration, of the type we have discussed, which leads to the non- 
conservation of baryon number. To generate a baryon asymmetry we require CP- 
violating interactions involving the quark fields. Then the transition rate for the 
sphaleron-induced process with ANcs = +1 will differ (slightly) from that with 
ANcs = — 1. Provided that the system is not in thermal equilibrium, there is then 
the possibility of a net non-zero baryon-number asymmetry. 


4.9 CP-violation in electroweak theory 


In electroweak theory the sole source of CP-violation is via the Cabibbo- 
Kobayashi-Maskawa (CKM) matrix, which derives a CP-violating phase from the 
unremovable phases in the Yukawa interactions of the quarks when we transform 
to the mass eigenstates. The form of these interactions is 


Ly = ó'dghp Qu + V'üghy QL - h.c. (4.209) 


where Q; is the quark doublet and ug, dg the singlets, $ is the Higgs scalar 
doublet and y = it2$*. As in equation (4.70), hy and Ap are complex matrices 
acting on the undisplayed generation indices of Q;, ug, dg. Also, as in equation 
(4.78), we need to construct a diagram with non-vanishing imaginary parts in both 
the loop integral and the trace over generation indices. It is easy to see that (4.81) 
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does not derive from a diagram allowed by the vertices in (4.209). In fact, the first 
contribution arises at the 72th order in perturbation theory, from the trace 


Ti = tr(hl huhi huh hp phi huh} hp) ^ h.c. (4.210) 


Again we may diagonalize hy and hp as in (4.82), (4.83), (4.84) and (4.85). 
Then [49, 50], 


12 
Ty = 2 tr(mul*V impt V my V mpi v*) (4.211) 
12 
64my 


where 
y 2 gs (4.212) 
is the CKM matrix and 


Imu|* = diag(m?, m2, m$) (4.213) 


and similarly for Impl‘, |my |? and \mp|?. Then 


12 
8 
Im Tn = — 3 3^ (mi); (md) mmp) IMV: Vu VV). (4.214) 
W ijk 


Now 
Im[Vij Vir Vj V] = J > eikp€jtq (4.215) 
Pq 
where J is the Jarlskog invariant [49]. In the standard CKM and Particle Data 
Group parametrizations respectively, it is given by 


J= sin26, sin 05 sin 63 cos 0j cos 0» cos 63 sind 


= sin012 sin 613 sin 623 cos 612 cos02, cos 23Sin 313. ^ — (4.216) 
Then 
gy 2 2 2.2 
Im Ti? = casia 7 me — m2)(m? — m,)(m; — m2) (m2 - m2) 
[7 
x m2 — m2)(m2 — m2) 
g? 44,22 
~ Jm,m,m-zm.;. (4.217) 
12 pb es 
64m y 


So we expect the scale of CP-violation in the standard model to be controlled by 


the parameter 
Sem = A (25) j| 5 |. (4.218) 
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With A ~ 10° perhaps to allow for the large numberof Feynman diagrams, we 
still have 

3sm € 107” (4.219) 
using the current data [7]. This tiny number scales the difference in free energies 
and, hence, the difference between the rates of the ANcs = +1 processes. 
By itself, it is sufficient to exclude the possibility of explaining the observed 
baryon number asymmetry of the universe in the context of the standard model 
electroweak theory, so extra sources of CP-violation are clearly needed. Such 
sources arise naturally in the supersymmetric version of the theory which we 
shall discuss shortly. Before doing so, however, we shall see that there are further 
reasons why the standard model electroweak theory cannot yield the measured 
asymmetry. 


4.10 Phase transitions and electroweak baryogenesis 


We have seen that electroweak theory possesses (sphaleron-induced) baryon- 
number non-conserving processes, as well as CP-violation via the CKM matrix. 
However, there is a rather general argument that these cannot generate the baryon 
asymmetry of the universe in the absence of a phase transition or if the phase 
transition is second order [51]. 

Suppose the latter and consider the universe at temperature 7 satisfying 


mw <T <T; (4.220) 
with T; the critical temperature of the phase transition. For a second-order phase 
transition 

r? 1/2 
mw(T)=mw IJ . (4.221) 


We can see this from (4.189) by setting C to zero. Then v(T) and, hence, my(T), 
approach zero continuously as T approaches T; from below; also, when C = 0, 
To = T = Te. As in the case of GUT baryogenesis (4.37), we now require 
that the rate Fsph of baryon-number non-conserving processes is smaller than the 
(Hubble) rate H (T) associated with the expansion of the universe, so that the 
baryons are decoupled from the thermal bath. The total sphaleron rate F'sph is 
obtained by scaling the rate per unit volume l'spp/ V, given in (4.201), with R? (1), 
where R(t) is the scale factor, proportional to T ^! in the radiation-dominated era. 


Thus, roughly, 
Vspn ~ KT exp |- (4.222) 


where Esph(T) is given in (4.197). With H (T) given by (4.21), decoupling only 
happens when the temperature T drops below T* with 


T* ` Te a mp 2? 2 


2 
T 
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This gives 


* 


0.71 < = < 0.88 (4.224) 


c 

corresponding to the bounds (4.194). Thus the sphaleron-induced baryon-number 
non-conserving processes with which we are concerned are not decoupled until 
the temperature is well below 7; but still well above my [51] and, at these 
temperatures, the exponential suppression essentially turns off the baryon-number 
production. At the higher temperatures where the Boltzmann suppression is 
evaded, the rate exceeds the universal expansion rate and the resulting thermal 
equilibrium washes out any baryon asymmetry. So the picture we have is as 
follows: As the temperature drops below To, the symmetry breaks and the field 
$ develops a non-zero VEV at the minimum of the potential, as in figure 2.1. 
At each point in space thermal fluctuations perturb the field @ which then ‘rolls’ 
classically to the new value at the global minimum. 1f this phase transition is 
second order or a continuous crossover, the slow rolling to the new minimum 
means that the departure from thermal equilibrium is too small until 7 drops 
below T* but by then the sphaleron-induced baryon production has been turned 
off by the Boltzmann exponential suppression. 

Thus, the only possibility is that the phase transition is first order. In this 
case, as discussed in section 2.9, for temperatures T > Tı, the only minimum 
of Veg is with the system in the symmetric, unbroken phase characterized by 
zero VEV. As the temperature falls below 7), a local minimum of Veg develops, 
separated by a potential barrier from that at zero VEV, and below the critical 
temperature (T < Te) this broken phase becomes the global minimum, see 
figure 2.2. Nucleation of the broken phase proceeds by the formation of bubbles 
of this true vacuum in the sea of false (symmetric phase) vacuum. At some 
supercooled temperature below 7;, the size of the bubbles becomes large enough 
for them to overcome the surface tension effects and they expand to fill the whole 
of space and complete the phase transition. Finally, for temperatures below To, 
the local minimum at zero VEV disappears and only the broken phase is stable. 

A baryon asymmetry may be generated as the wall of an expanding bubble 
passes through a region containing particles in the unbroken phase. The Higgs 
field changes rapidly because of the wall motion, as do other fields, and these 
interact with the particles giving concentrations quite far from equilibrium. If the 
baryon-number non-conserving processes and the CP-violating processes both 
occur in or near the wall, a net non-zero baryon asymmetry can result: this 
scenario is called local baryogenesis [52,53]. After the wall has passed the 
region we are discussing is in the true (broken phase) vacuum with v(T) x 0, 
so it is important that, in this phase, the sphaleron-induced, baryon-number non- 
conserving processes are turned off by the Boltzmann suppression, so that any 
baryon asymmetry produced during the non-equilibrium era is frozen in. The 
condition for this to happen is [50] 


v(Te) 
T 


21 (4.225) 
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This is called the ‘sphaleron washout condition’. In non-local baryogenesis, CP- 
violating interactions of the particles with the bubble wail produce an asymmetry 
in a quantum number other than the baryon number and the resulting particles 
carry this asymmetry into the unbroken phase, away from the wall. Then baryon- 
number non-conserving effects convert this asymmetry into a baryon asymmetry, 
(some of) which is frozen in the broken phase after the bubble wall has passed. 
If the speed of the wail is greater than the sound speed in the plasma, the former 
process dominates [54]; otherwise the latter does but, in general, both may occur 
and the total baryon asymmetry is the sum of that generated by the two processes. 

We have already noted that the phase transition is (weakly) first order in 
electroweak theory and that at the phase transition (2.61) gives 


2CT, 
bc = w(T)- 3 (4.226) 
5o (Ta 2C 
Vlie 
=— <9. 4.2 
7 3 £017 (4.227) 


which does not satisfy (4.225). In fact, requiring that (4.225) is satisfied would 
require 
my € 41 GeV (4.228) 


in clear contradiction to the current lower bound (4.195). 

The perturbative calculations of the corrections to the effective potential are 
not a priori reliable, because of the so called ‘infrared problem’ that afflicts finite- 
temperature field theory [55, 56). It derives from the existence of an expansion 
parameter of the form 

2 2 

€= =i ~ gr when m & T (4.229) 
where m is some bosonic mass appearing in the propagators. Then light modes, 
those with m < gT, interacting with the Higgs are a problem that should be 
treated non-perturbatively. The direct method of carrying out a four-dimensional 
finite temperature lattice simulation is difficult because the weak coupling entails 
the existence of multiple length scales which are difficult to fit simultaneously on 
a finite lattice. Also, in practice, chiral fermions cannot be handled efficiently. 
However, all of these problems can be overcome by using a finite-temperature 
effective field theory [57-59], obtained by integrating out (perturbatively) all non- 
zero Matsubara modes, which includes, in particular, all fermions. The resulting 
effective theory is then three-dimensional and involves only the surviving infrared 
modes, the Higgs and the spatial components of the SU(2) and U(1) gauge fields. 
This theory is ideally suited for lattice simulations. It is found that, in the m 4- 
T. plane, there is a line of first-order phase transitions that end at a critical point 
after which there is only a crossover transition. The endpoint is known to high 
precision and is at 


myc = 72.3 GeV — T. = 109.2 GeV. (4.230) 


132 Baryogenesis 
Further, the condition (4.225) is satisfied only for 


my € 10 GeV. (4.231) 


Thus, the more accurate calculation of the phase transition shows that the 
current bound (4.195) is far from allowing a first-order phase transition, let alone 
baryogenesis. 

This analysis also indicates how the situation might be improved. The 
strength of the phase transition, as measured by v(T-)/ Te, can be increased by 
substantially decreasing the effective (three-dimensional) scalar self-coupling A3, 
as is suggested by (4.227). This requires a new, non-perturbative degree of 
freedom and this happens when there are extra scalar degrees of freedom, as 
occurs naturally in the supersymmetric version of electroweak theory to which 
we now turn. 

In summary, the standard model electroweak theory does not explain the 
observed baryon asymmetry for two reasons: (i) because there is insufficient CP- 
violation; and (ii) because the phase transition is too weakly first order (in the 
sense described earlier) to suppress the erasure of any baryon number asymmetry 
produced in the symmetric phase. 


4.11 Supersymmetric electroweak baryogenesis 


We have already noted that the extra matter contained in the minimal 
supersymmetric standard model (MSSM) might allow the alleviation of 
the problematic features of the non-supersymmetric theory that preclude 
baryogenesis at the level observed in nature. Supersymmetry entails the existence 
of new fermions. In particular, there are charginos and neutralinos, mass 
eigenstates that are generically superpositions of the charged or neutral weak 
gauginos and Higgsinos. There are also new bosons and we shall be specifically 
concerned with top squarks. Diagonalization of the mass matrices of all of these 
states generally leads to new sources of CP-violation. As we shall see, it is the 
existence of new particles (and thereby of new sources of CP-violation), rather 
than the supersymmetry itself, that might allow the MSSM to explain the observed 
baryon asymmetry. 

Now consider an expanding bubble of the broken phase, with the bubble 
wall propagating through the hot plasma into the symmetric phase, perturbing 
the (quasi-)particle distributions from equilibrium. Inside the bubble, baryon- 
number non-conservation is small, because of exponential suppression by the 
sphaleron's Boltzmann factor, provided that the sphaleron washout condition 
(4.225) is satisfied. Effectively, baryon number is conserved inside the bubble. 
However, outside the bubble, anomalous baryon-number non-conservation is 
rapid. One way to see how baryogenesis occurs [60] is to think of the wall of 
the expanding bubble feeling a ‘wind’ of particles in the symmetric phase. These 
particles may pass through the wall into the broken phase or be reflected back 
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into the symmetric phase region. The latter will interact and be slowed by other 
particles approaching the wall, before eventually passing through the wall. Thus, 
an accumulation of particles develops in front of the bubble wall. As a result of the 
CP violation, there is a non-zero difference between the transmission coefficients 
of these particles and their antiparticles across the wall of the bubble and a similar 
difference between the reflection coefficients. These differences, in turn, generate 
local source terms for the net number densities associated with the particles 
and, in particular, the Higgs number and axial top number, which appear in the 
coupled Boltzmann equations, tending to pull the system away from equilibrium. 
These particles are chosen because they participate in particle-number-changing 
transitions in the wall that are fast compared with relevant time scales but they 
carry charges that are approximately conserved in the symmetric phase. Transport 
effects then generate a local excess (or deficit) of left-chira] charginos (say) over 
their antiparticles ahead of the advancing bubble wall. Unsuppressed baryon- 
number non-conservation in the symmetric phase then converts these densities 
into a net baryon asymmetry, which is frozen as the bubble wall sweeps through, 
provided that the sphaleron washout condition is satisfied. 

All calculations [60-64], of ng in the symmetric phase are done using 
coupled diffusion equations for the relevant number densities, which include 
contributions arising from source currents generated at the bubble wall, scattering 
processes involving the top quark Yukawa coupling, as well as Higgs-number and 
axial-top-number-violating processes in the bubble wall and broken phase. Higgs- 
number and quark-number diffusion terms are also included. There is general 
agreement on the equations to be used (see (4.232)) but little on how to determine 
the source currents. 

We have also noted that the MSSM affords new mechanisms for satisfying 
the washout condition (4.225) that are unavailable to the standard model. 
Specifically, the possible existence of a light SU (2) scalar top squark fr (a ‘stop’) 
that interacts strongly with the Higgs field might drive the necessary reduction 
in the effective three-dimensional scalar self-coupling 43 needed to generate a 
sufficiently strong first-order phase transition with a Higgs mass satisfying the 
current bounds [65]. Any such scalar gives a negative contribution to A3 at 
one-loop level but a light left stop tz is inconsistent with electroweak precision 
measurements. Numerical calculations [65] have confirmed two-loop estimates 
and shown that these are even somewhat conservative. The conclusion is that there 
are parts of the MSSM parameter space not excluded by experiment where the 
electroweak phase transition is strong enough to allow baryogenesis. However, 
besides needing a light stop m;, S m, there must either be a much heavier stop 
m;, 2 10m; or else two independent light Higgs particles ma,4 S; 120 GeV. (h is 
the scalar Higgs and A the pseudoscalar.) 

The next question then is whether in this region of parameter space the 
additional sources of CP-violation in the MSSM can generate source terms for 
the various particle densities that are strong enough to induce sufficient baryon 
asymmetry in the symmetric phase, which is then frozen in as the bubble wall 
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passes through. The most direct method of baryogenesis would, of course, be to 
use the left-chiral quarks themselves, since any CP-violating contributions to their 
distributions would directly bias the sphaleron interactions and thereby generate 
a baryon asymmetry. However, in the MSSM (but not in all two-Higgs doublet 
models) the Higgs potential is real at tree level and CP-violating contributions 
to quark masses arise only at one-loop order. Moreover, such contributions are 
potentially suppressed by the GIM mechanism, as in the standard model. We are, 
therefore, forced to consider CP-violating perturbations in other particle densities. 
Specifically, we consider squarks, which couple to quarks strongly via the strong 
supergauge interactions, and charginos, which couple strongly to third-generation 
quarks via Yukawa interactions. Neutralinos also contribute but their coupling to 
fermions is weaker than that of the charginos, so the transport of any asymmetry to 
the quark sector is much less efficient and we neglect them. The coupled diffusion 
equations that control the densities with CP-violating sources have the general 
form ; ; 

Dib, + v,£; +T: + 5j + °°) = Si (4.232) 


where & = yi /T with u; the chemical potential of the ith species, D; is a 
diffusion constant, vy is the velocity of the bubble wall, T; is the inelastic rate 
converting species i into other species j,..., and S; is the CP-violating source 
term created at the bubble wall; the prime denotes differentiation with respect 
to the z-direction in which the bubble wall propagates. Different approaches 
have been proposed for calculating the source terms [31, 63, 66], and it is unclear 
whether they agree. We shall use the classical force method [63, 64, 67, 68], in 
which the particles move in the plasma under the influence of a classical force 
exerted on them by the spatially varying Higgs field. Because of CP-violation, 
particles and antiparticles experience (slightly) different forces. The source terms 
in the diffusion equations are proportional to the thermal average of this CP- 
violating component of the force. 

For illustrative purposes, we shall follow the treatment of Cline et al [69]. 
The set of coupled diffusion equations can be simplified considerably by taking 
account of the hierarchy of inelastic reaction rates T; that change the particle 
species i into other species j,.... The electroweak sphaleron rate Ts, of order 
o T (see (4.205)), is the slowest and can be ignored until we are ready to compute 
the actual baryon asymmetry. In contrast, the various gauge interaction rates, of 
order o4 T, are fast and can be taken to be in equilibrium on the time scale for 
particles to diffuse in front of the bubble wall: 


Di 
aT > =>. (4.233) 
Vy 
Then, in particular, the chemical potentials of the weak bosons are zero, so the 
chemical potentials of quarks in the same doublet are equal: 


En = $b, = E. (4.234) 
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Similarly, the assumption that supergauge interactions are in equilibrium implies 
that 


£p — Ej (4.235) 


for any particle p and associated sparticle p. Intermediate between these 
interaction rates are those of various inelastic processes, including those 
associated with the interaction Lagrangian and 'strong sphaleron' processes. We 
shall see in the following chapter that, although baryon number is conserved by 
QCD, the axial baryon-number current 


J£) 2 y 'dyuysq (4.236) 
q 


is anomalous and there are strong sphaleron solutions, analogous to the 
electroweak sphalerons, that connect different topological sectors. Their 
contribution to the coupled diffusion equations conveys the CP-violation to 
all quark flavours. The rate Tss of such processes is of order a3T and 
comparable with those arising from the top Yukawa coupling, so they must be 
included. Although neither of the foregoing assumptions is a particularly good 
approximation, it is believed, or at least hoped, that they lead to multiplicative 
errors of at most of order unity in the predicted baryon asymmetry. 

In the analysis of [69], the sources S; of the CP-violating asymmetry arise 
solely from the chargino sector. The squark sector is ineffective because, for 
bosons, the CP-violating source arises only at second order in the gradient 
expansion of the CP-violating mass. Charginos are mass eigenstates arising from 
Higgsino-Wino mixing. However, any asymmetry from the Wino component 
can only be transported to a chiral asymmetry in quarks and squarks via mixing 
effects, whereas a Higgsino asymmetry is transported directly via strong Yukawa 
interactions. Thus, the set of diffusion equations to be considered can be 
simplified by neglecting chargino mixing and dropping any Wino contributions. 
With the foregoing approximations, the number of coupled diffusion equations 
is reduced to those for the two Higgsino densities §; and &;,, which have CP- 
violating sources Sj, and Sin plus those for the third quark generation doublet 
£g, and the right-chiral top £g, and those for first- and second-generation doublets 
&4,. and right-chiral singlets £g, which are coupled to the first four equations 
only by strong-sphaleron interactions. The equations may be solved numerically 
and the last step is to calculate the baryon asymmetry induced by weak sphaleron 
effects on the CP-violating asymmetries £;. 

Since the weak sphaleron derives entirely from the SU (2); component of 
the electroweak gauge group, the baryon asymmetry results from left-chiral quark 
and lepton asymmetries in front of the bubble wall. The latter are essentially zero 
in this approach and the former enter the baryon-number-violating rate equation 


dag 3 


ng 
SE = Tp (ta - Az) (4.237) 
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via 

Sq. = 35g, + Eq + Eq) (4.238) 
because each doublet occurs in three colours. Here Pp is the weak-sphaleron 
baryon-number non-conserving rate (4.205) or (4.208). The second term on the 
right-hand side is the Boltzmann term by which the baryon number would relax 
to zero if the sphaleron processes had time to equilibriate in front of the bubble 
wall. The ng here is, therefore, related to the quark and lepton asymmetries, q 
and uz, that result from equilibriating all flavour-changing interactions that are 
faster than the sphaleron rate lp in the symmetric phase. Thus, A is given by 


n 
Azi -9u, Y ut (4.239) 
t 


since each sphaleron creates nine quarks and three leptons. All quarks have the 
same chemical potential, because of efficient mixing, but lepton mixing might be 
weak. The calculation of these chemical potentials depends on which interactions 
equilibriate on the relevant timescale. The asymmetry n4 for any particle species 
a is given by 

Ha T? 

6 

where xa = 1,2, respectively, when a is a fermion or boson. In electroweak 
baryogenesis, the relevant time scale is I'; ' and, on this scale, both chiralities of 
all six quark flavours in three colours do equilibriate and we include a number 
Neq of light squarks. Thus, from the quarks and squarks, we have 


(4.240) 


na = na — hà = Ka 


] 
ng = ing nsg)- "A6 x 3x 242 x 3M ]T? (4241) 
so that ; 
ng Nsq 7 
=—>{1+—] . . 
HQ = 2p ( + 6 ) (4.242) 
Similarly, since only the left-chiral leptons equilibriate, but not the right, 
ne = 4T? (4.243) 
and 
ne ng 
Yo me =3) m = 357° (4.244) 
t t 
Thus, i 
9 Nsq \ 
A-5 ( + J +3. (4.245) 


The solution of (4.237) is found by transforming to the wall frame in which 
ð; — —Vy9,. Then 
_ 3T sph 


oo 
-bz 
np Í dz&,e (4.246) 
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where 


(2v TÀ 


Consequently, the quark asymmetry £,, , found from solving the coupled diffusion 
equations in terms of the Higgsino sources Sj, (2) and Sj, (2), determines the 
baryon asymmetry ng and, hence, the baryon-to-photon and baryon-to-entropy 
ratios ng = ng/ny = Tnp/s. 

The conclusion is that the MSSM can explain the observed value (4.18) but 
several independent parameters must be optimally tuned to do so. First, the CP- 
violating phase in the chargino mass matrix must be close to maximal. In order 
to accommodate experimental data on electric dipole moments, especially that 
of mercury, this requires that the lower generation squarks must have masses of 
order 10 TeV. Actually this is necessary to maximize the chiral quark asymmetry. 
Also m;, ~ 10 TeV is required to give sufficiently large radiative corrections to 
mg, given the already noted need for a light fr (m;, S mr) to satisfy the washout 
condition. In addition, tan $ = v,/v4 € 3 is required, the wall velocity vy, must 
be close to its optimal value of 0.02 and the walls should be as thin as they can be 
for the validity of the classical force method, about 6/T. (Vu, a are, respectively, 
the VEVs of the (two) Higgs doublets ^, in (4.59), used to give masses to the 
uplike quarks, and ^2 in (4.60), used to give masses to the downlike quarks.) 
Similar conclusions have been reached by Huber and Schmidt [70]. 

Although the MSSM can explain the observed baryon asymmetry, it is 
evidently not generic. The region of parameter space in which it does so is 
very constrained, and might well be excluded by future experiments that set 
new bounds on sparticle masses, for example. We therefore comment briefly on 
alternatives that have been proposed but which, however, have not been as fully 
studied as the other methods we have described. 


b (4.247) 


4.12 Affleck-Dine baryogenesis 


The discussion of baryogenesis in the previous section made hardly any use 
of the supersymmetry of the MSSM. Rather, the MSSM supplied new fields 
that strengthened the first-order phase transition and which also developed the 
chiral asymmetries that were subsequently converted to a baryon asymmetry. 
In contrast, the Affleck-Dine mechanism [71] uses a generic feature of any 
supersymmetric theory, namely the existence of ‘flat’ directions, to generate 
a large VEV for a field carrying non-zero B — L in the early universe; at 
temperatures higher than the electroweak phase transition, we have already 
noted that weak sphaleron processes are in equilibrium, so that any B + L 
asymmetry is erased. In fact, perturbative baryon-number conservation in the 
MSSM is achieved by imposing a discrete R-symmetry that has the effect of 
excluding certain dimension-four operators from the superpotential that would 
otherwise explicitly generate baryon-number non-conservation, see [11] for 
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example. Even so, there remains the possibility of dimension-five, and higher, 
non-renormalizable operators, whose effects are suppressed by at least one 
power of a (hopefully) superheavy scale M, e.g. the Planck mass mp. The 
alternative approach to baryogenesis, proposed by Affleck-Dine utilizes both of 
these features. 

First, recall that in a supersymmetric theory there are scalar fields with 
non-zero baryon or lepton number. Before supersymmetry breaking, there are 
generally many (D- and F-) flat directions. These are directions (in field 
configuration space) along which the potential is a constant (i.e. ‘flat’). These 
include directions that allow gauge invariant combinations of squark and/or 
slepton fields to develop non-zero VEVs. For example [72], the MSSM 
superpotential (see [11]) 


W = A, QHy,u^ + AaQ Had* + A, L Hae (4.248) 


has an F-flat direction parametrized by the complex field ¢ as follows: 


q 0 id 
'-($) n-(5) d3 =¢ (4.249) 


with all other fields zero; subscripts label the generations, and superscripts are 
colour labels. The auxiliary F-terms (Fp = 9W/d®, as in section 2.7) of all 
chiral superfields «P vanish in this direction, and the fact that $ is complex shows 
that there is global U (1) symmetry associated with it. This direction is also D- 
flat. In other words, the D-terms (D^ = ^4 tr, as in section 2.7) for all 
three gauge groups also vanish. It follows that the scalar potential 


V=5) 8 DPD? e FF. (4.250) 
i > 


is zero and, therefore, flat in this direction (the summed index a runs over the 
adjoint representation of the corresponding group.) Thus, the scalar particle 
associated with the field $ is massless. Fields such as this, associated with flat 
directions, are called ‘moduli’ fields and the massless particles associated with 
them raise cosmological questions that we shall discuss later. In our example, the 
combination of non-zero fields associated with the flat direction 


X = Qt Lid (4.251) 


is gauge invariant and has B — L = —1. In general, the gauge-invariant 
combination X is proportional to a power of the field parametrizing the flat 
direction: 

X xg” (4.252) 


In our example, m = 3. As detailed later, various effects lift the flatness and may 
allow $ to develop a VEV. If these VEVs are large, the subsequent evolution of 
the universe can develop a substantial baryon asymmetry. 
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To see how, recall that the superpotential W is expected to have higher-order, 
non-renormalizable terms. These must be gauge and R-parity invariant. Thus, 
focusing on a single flat direction, in general 


X "n 
Wor = x* o" (4.253) 


M^-3 = nM"? 
where n — mk; k iseven if X has odd R-parity, as in the previous example, and M 
is some large mass. This gives F-terms that are non-zero in the specifed direction. 
Note, however, that although the superpotential Wnr does not conserve U (1)p. r, 
the scalar potential V derived from it does: 


lA? 


Vor = M2n-6 


iei? (4.254) 


since it is proportional to ($*9)-! . 

We shall see later, in chapter 7, that there is good reason for believing that 
in the early universe there was a period of 'inflation', during which the Hubble 
constant H was approximately constant. The second, and most important, point 
is that global supersymmetry is necessarily broken during inflation. Since the 


vacuum energy 


3 
(V) = a; me (4.255) 


is non-zero, there must be non-zero F and/or D components for some matter 
fields, and a soft potential develops. This includes a mass term for the (erstwhile 
massless) moduli field $, of order H, the (instantaneous) Hubble constant [72]. 
In fact, H ~ 1027/5 GeV during inflation, as can be seen from (7.41) with 
V(@) given by (7.115) in order to explain the observed cosmic microwave 
background. A mass term too conserves B — L. The supersymmetry breaking 
also induces (soft) A-terms in the scalar potential. These have the same form as 
the superpotential W, so, in our example, they have the form 


A 
Va = wat + h.c. (4.256) 


Like the mass term, the scale of A is of order the current Hubble constant H. 
Note that such terms do not conserve B — L and this is the source of the (B — L)- 
violation necessary to generate a net B — L in the evolution of the flat direction. 
The Sakharov criteria, necessary for baryogenesis, also require CP-violation and 
we shall see later that a CP-violating phase difference between this A-term and 
that arising from the hidden-sector supersymmetry breaking is essential to getting 
a non-zero baryon asymmetry at the end. 
The equation of motion for the field ¢ is 


" .. OV 
o+3HO+ 4 = 0 (4.257) 
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where V is the potential obtained by combining the contributions from Vpr in 
(4.254), V4 in (4.256) and the mass term for $. It has the form 


aXAH, o" 107? 
nM" M?n-6 


where H; is the (approximately constant) value of the Hubble parameter during 
inflation and a, c are constants of O(1). This, of course, is just the equation of a 
damped oscillator and the important point is that, during inflation, it is close to 
being critically damped. If c « 0, which corresponds to positive mass-squared for 
$, V has a minimum at $ = O. In this case, the average value of the field evolves 
exponentially to ¢ = O and the large value at the end of inflation needed to get 
a baryon asymmetry is not achieved [72]. However, if c > 0, which, in general, 
requires non-minimal Kahler terms, V has a single minimum at ġo, where 


1/(n-2) 
H M3 
idol = (e) (4.259) 


= -cH2ef + ( +he.) + (Al (4.258) 


A 


with B a numerical constant which depends on a, c, n. Thus, |ġo] is parametrically 
between H; and M, which can easily be large. In the angular direction, the 
potential varies as cos(arga + arg A + n arg) and has n degenerate minima. 
Further, again because of the near critical damping, the field evolves rapidly to 
one of these minima, provided c is not too small [72]. Thus, at the end of inflation, 
the average value of the field has a large value with a well-defined phase, which 
is constant over scales large compared with the horizon. 

This sets the boundary condition for the next era. After inflation the universe 
enters a matter-dominated era in which the Hubble constant is explicitly time- 
dependent: 

2 

EE" 

(see section 1.4). The equation of motion is still given by (4.257), with V of 

the form (4.258) but with H now given by (4.260). As t increases, H decreases, 

so that the instantaneous minimum of V also decreases. Solving the equation of 
motion reveals that $ tracks just behind this decreasing minimum. 

This evolution continues until H ~ m3;2 ~ 1 TeV, where m3/2 is the 
gravitino mass. At that point, the soft supersymmetry-breaking terms from the 
hidden sector become comparable with those arising from inflation and, at later 
times, dominate the evolution. The hidden-sector terms contribute a positive 
mass-squared term for ¢ as well as an A-term, both having scales determined 
by 73/2. Thus, the additional contribution to the potential V has the form 


(4.260) 


Am3/24$" 
Vhs = m2ler? + (mee + he.) (4.261) 


where mg ~ m3;2 and A = O(1). Consequently the equation of motion (4.257) 
becomes underdamped as H decreases below m3/?. Two important effects now 
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come in to play. First, the positive mass-squared term dominates the inflationary 
contribution, so that @ begins to oscillate undamped about ¢ = O, with initial 
condition $ = P(t) at t ~ m3, P The oscillation of $ is the coherent condensate 
with number density 


ng = P = meld? (4.262) 


where pg is the energy density in the condensate. Second, when the (CP-violating 
and (B — L)-violating) hidden-sector A-term dominates the inflationary term, the 
potential in the angular direction varies as cos(arg A + arg A + narg@). Thus, 
if arg A Æ arga, a non-zero ‘torque’ is created and a non-zero 6 develops. This 
is precisely what is needed to create a baryon/lepton asymmetry. The number 
density for the U (1) g—1 charge of the condensate is 


ng—L = il" dob — $899") = 219176. (4.263) 


The equation of motion has been integrated numerically by Dine et al [72], who 
find that, at late times (t >> m3/2), the ratio ng; /n$ generically evolves to a 
constant of order unity. At late times, the potential is dominated by the (positive) 
mass term which, of course, conserves B — L. Thus, the B — L created during 
the time when H ^ m3,2 is conserved. 

It remains only to convert this O(1) ratio to the physically relevant baryon 
to entropy ratio ng; /s. When H ~ m3/2, the energy density of the condensate 
pe ~ m lel? is much smaller than the energy density p; associated with the 


coherent oscillations of the inflaton, p; ~ EH 2m3. Using (4.259), we see that 


2/(n-2) 
n-3 
P$ .., mo . (4.264) 
Pl dum’ 


This ratio remains approximately constant until the inflaton decays at some time 
when H < m3;2. Provided that the inflaton decays dominate, the entropy density 
is given by 

~ PL 


~ 4.265 
"Tm ( ) 
where Tp is the reheat temperature after inflaton decay. Thus 
- -LT 
"B-L _ B-L ÍR P (4.266) 


S ng mọ Pi 


The ratio (4.264) is very sensitive to n. Forn > 4, M ~ mp anda reasonable Tg, 
the ratio n g—1/s is generally too large. For example, to get the observed value of 
npg—L/s ~ 107! with n = 6 requires Tg to be of order the electroweak scale. In 
contrast, n — 4 gives 


"B-L — |q-10 Tr 1073M 
s t0 (x Lx) ( Am p (4.267) 
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which gives an acceptable ratio for a reasonable range of the parameters. 

As long as the ¢ condensate decays via (B — L)-conserving decay processes 
after the inflaton, the previous estimate of the ratio is insensitive to the details of 
the decay. However, even though most of the inflaton energy is not converted to 
radiation until H = Hr < m3/2, the condensate might also decay via thermal 
scattering as soon as $ is small enough, which is when g@ < T, where g is the 
gauge or Yukawa coupling constant. This must be unimportant when H ~ m3,2. 
However [72], the value of $/T is rather sensitive to the integer n. For n — 6, the 
required condition is comfortably satisfied but the n — 4 case is borderline. 

The only flat direction having B — L non-zero and having n — 4 corresponds 
to 


= + (LH, 
W= M (LH,) (4.268) 


which might arise directly at the string scale or be generated from a GUT by 
integrating out a heavy standard-model singlet field N with coupling gL H,N. At 
low energies, this operator generates neutrino masses 

A 


~ 4. 
m ~ MU (4.269) 


where v is the Higgs VEV. Then 


NB-L —10 Tg 1078eV 
-1 — — . 
s 9 (s Es) ( my (4.270) 


where m, is the lightest neutrino mass. Taking Tg < 10? GeV then requires at 
least one neutrino to be lighter than about 107? eV, with an even stronger bound 
of 107? eV if the constraint Tg « 10° GeV is enforced to ensure that thermal 
scattering is unimportant when H ~ m3;2. Interestingly, in this case, non- 
minimal Kahler terms are not needed to ensure a negative mass-squared for 9. 
This is because such terms can arise for the Higgs field via radiative corrections. 


4.13 Exercises 


1. Show that the nucleon-antinucleon annihilation rate falls below the 
expansion rate when the temperature T ~ 20 MeV. 

2. Show that in a time-reversal invariant theory baryon number is conserved. 

3. Verify that for the SU (5) GUT N, has the value given in equation (4.41), 
and that for the supersymetric theory, it has the value given in (4.42). 

4. Show that the superheavy gauge bosons in the SU (5) GUT have the decay 
modes given in equation (4.67) and that the colour-triplet Higgs has the 
decay modes given in (4.71). 

5. Verify that in the SU(5) GUT there are no two-loop contributions to 
T (H3) — (H3) and that the three-loop contribution given in equation (4.81) 
does arise in the theory. 
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6. Verify equation (4.86). 


7. If the SO(10) GUT symmetry-breaking Higgs transforms as the 16- 
dimensional representation, show that the gauge group breaks to SU (5) x 
U (1), and if the Higgs transforms as the 54-dimensioinal representation, then 


the gauge group breaks to G422 given in equation (4.97). 
8. Verify equations (4.130) and (4.131). 
Verify equation (4.136). 


10. Show that in the standard model the communication of CP-violation to the 
baryon-number non-conserving processes, via the phase in the CKM matrix, 
arises first in 12th order perturbation theory, as given in equation (4.210). 


Verify equation (4.214). 


11. Verify that decoupling of the baryon-number non-conserving interactions 


occurs at a temperature T* given by equation (4.223). 


12. Verify equation (4.240) relating the asymmetry n 4 to the chemical potential 


Ha. 


13. Show that in the direction (4.249) all of the F-terms derived from the 


superpotential (4.248) are zero. 
14. Show that in the direction (4.249) all of the D-terms are also zero. 


15. Verify that the potential (4.258) has a minimum of the form given in (4.259). 


16. Verify (4.264). 


17. Show that in the MSSM the subspace of field directions in which all D-terms 


vanish is 37-dimensional [72]. 


4.14 General references 


e  Riotto A 1998 Trieste 1998, High Energy Physics and Cosmology pp 326- 


436, arXiv:hep-ph/9807454 
e  Trodden M 1999 Rev. Mod. Phys. 71 1463, arXiv:hep-ph/9803479 


e Cline J M, Joyce M and Kainulainen K 2000 JHEP 0007 018, arXiv:hep- 


ph/0006119 
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Chapter 5 


Relic neutrinos and axions 


5.1 Introduction 


We saw in chapter | that, for much of the time, the constituents of the early 
universe were in approximate thermal equilibrium. This is because the rates for 
the interactions of these constituents were large compared with the expansion 
rate H. However, this thermal equilibrium was not maintained all of the time. 
If it were, the current state of the universe would be entirely determined by its 
temperature and we noted in the previous chapter the huge disparity between, 
for example, the equilibrium baryon abundance and that actually observed. 
Departures from equilibrium are, therefore, extremely important in determining 
the abundance of the relics that can be observed today. 
The equilibrium number density 7 x eq of a species X is given by 


ey eee 
meer ny f: P EMIT +1 (5.1) 


where g is the number of internal degrees of freedom, E(p) = 4/(1 p? +m?) and 
+1 relates to fermions and — 1 to bosons. In the relativistic limit T >> my, this 
gives for bosons 


3 
I LESE (5.2) 
and for fermions 
3t) _3 
where ¢(3) = 1.202 06. A similar calculation shows that both the energy density 
__8 3 I 
and the pressure 
2 
S m l 
- m € 5.5 
PXe = gy f: P 3E(p) EPIT x1 po 
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scale as T in the relativistic limit and the total (relativistic) energy density is 
2 
n 
p =3p = gs, TT* (5.6) 


where 4 4 
Ti 7 Ti 
BT = > 8i (2) t 8 2 gi (2) (5.7) 
ns fermions 
and we are allowing for the possibility that different particle species i may be at 
different temperatures 7;. 
In thermal equilibrium, the entropy per comoving volume 
S PtP 


Ss =Z — => 


V T 


is dominated by the contribution of relativistic particles and, to a good 
approximation, it is given by 


(5.8) 


2n? 
s= 75 ges, TT? (5.9) 
where 3 3 
TY 7 Ti 
Best = s (2) +3 Do ai (2) (5.10) 
bosons fermions 


Comparing (5.7) and (5.10), we see that when T; = T, so that all particle species 
are at the same temperature, g.,r = £«s,T = Na, with N, defined in (1.104). 
However, in general, they differ. 

Since the entropy per comoving volume is conserved, it is useful to measure 
the abundance of a species X by scaling its number density with the entropy 


density. We therefore define 
nx 


Yy = PE (5.11) 
Then, using (5.2),(5.3) and (5.9), the equilibrium abundance in the relativistic 
limit is 45t (3) 
geff Seff 
= = 0.278 5.12 
Ken 77^ gar B«S.T (5.12) 


where for bosons ger = g, and for fermions gett = 3g /4. 
All cosmological relics contribute to the current total energy density oo of 
the universe, and it is customary to scale these densities with the critical density 


2 


3H 2 -3 
Pc = = 10.54h* keV cm (5.13) 
8xGn 


where Ho = 100h km s^! Mpc™' is the present Hubble constant and 


h = 0.714308. (5.14) 
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The dimensionless measure of the total energy density is then defined by 
Qo = — (5.15) 


and, similarly, for a relic species X whose current energy density is px,0, we 
define 
— PX,0 


Pc 


Qx (5.16) 


Thus, 
$ 2x0 = Qo (5.17) 
X 


and the energy density for any one relic species must be less than the total energy 
density, so 


2x0 < Qo (5.18) 


where data from the latest microwave anisotropy probe (WMAP) [1] give 
Qo = 1.02 + 0.02. (5.19) 


The current energy density of a species X is given in terms of the current 
abundance by 


Px,0 =nx,omx = Yx osomx (5.20) 


so provided we can calculate the current abundance Y x o, and the current entropy 
density so, a bound on the mass m y of any relic species may be obtained: 


[0597 
so¥x.o 


A stronger bound may be obtained by replacing 29 by Rm where the latter derives 
from the total matter content in the universe. The WMAP analysis gives 


2 +0.008 
mh? = 0.135+9-008 (5.22) 


with A given by (5.14). 

In the next section, we shall apply the foregoing considerations to neutrinos, 
in order to see what can be inferred about their masses. In section 5.3 we shall 
attempt a similar analysis for ‘axions’, hypothetical particles that are required to 
exist if the ‘strong CP problem’ of the standard SU(3) x SU(2) x U(1) model of 
particle physics is solved by the 'Peccei-Quinn' mechanism, currently the only 
known solution of this problem. Axions must be very light, like the neutrinos. 
If they have survived until the present, their mass too is strongly constrained by 
various astrophysical and cosmological data. 
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5.2 Relic neutrinos 


Uniquely among elementary particles, neutrinos participate only in weak (and 
gravitational) interactions. In the early universe, scattering processes, such as 
ve < ve, and annihilation processes, such as vy <> ee, kept the neutrinos in 
thermal equilibrium. The total cross section for such processes is o ~ GiT?, 
just on dimensional grounds, since the weak (Fermi) coupling constant Gr ~ 
1075m;? has dimensions [M 72]. Since from (5.3) the relativistic number density 
Ry,eq i$ proportional to T, the total interaction rate Tin ~ a Vry eq ~ G2T5. 
When this is large compared with the Hubble expansion rate 


H=,/ 3 =y 45 T (5.23) 


there is thermal equilibrium. However, when T ~ 1 MeV, the two rates are 
comparable: Tin, ~ H. Below this temperature, the Hubble expansion dominates 
and thermal equilibrium is not maintained. The neutrinos are, therefore, 
decoupled or ‘frozen out’. Their abundance is frozen at the value obtained at 
the decoupling temperature Tgec ~ 1 MeV. Thus, the present abundance 


Y, = Yv.eq. Ta. (5.24) 


where, using (5.12), 


ny eff 

Yy cq.Tice = — = 0.278 ———. (5.25) 
Pea dee 5 8S ,Tdec 

For a single (left-)chiral neutrino species ger = 3/2 (including the antineutrino) 

and, since Tyec ~ 1 MeV, 


8*5 T = 2+ £4 +3x2)= 2 (5.26) 


keeping only the electron and three families of chiral neutrinos as relativistic at 
this temperature. 
In order to determine the bound (5.21), we first need to calculate the present 


entropy density 
2 


2x 3 
$0 — "4s 855 mTo (5.27) 


where g«5,7, is given by (5.10). At T = To, the (relativistic) species contributing 
to so are the photons, having 2 — 2, and the three families of neutrinos, also with 
g = 2 (including the antineutrinos). Thus, 


7 T, X 
EsSTy—72*z2x3x2 (z) . (5.28) 
8 To 


The temperature of the neutrinos T, differs from Tọ because after neutrino 
decoupling, when the temperature falls below T = m, ~ 0.5 MeV, electrons 
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and positrons annihilate via e*e^ — yy and the entropy in the e* pairs is 
transferred to the photons but not to the neutrinos which are already decoupled. 
For Tac > T > me, the species in thermal equilibrium with the photons 
are the photons (g = 2) and the electrons and positrons (g = 4), so that 
ge = 2+ a4 = 4, When T < me, only the photons are in equilibrium, so 
that g, = 2. Conservation of entropy, which is proportional to g«s T, therefore 
requires that the photon temperature increases by a factor ( By 3 following the 
pairs’ annihilation, while the temperature of the neutrinos is unaffected. Thus, 


1/3 
m (4) - 
£«S.Ty = T] (5.30) 


and, with Tọ = 2.725 K, equation (5.27) gives 
so = 2889 cm ?. (5.31) 
Finally, we note that from (5.19) and (5.14), the WMAP data give 
Qoh? = 0.51 + 0.04. (5.32) 
Putting all of this together, we find that 


2. 3 
Qu Hg = —— d.n. Gy Tam, (5.33) 
where 
Q, = 22, (5.34) 
Pe 
Thus, 
my = Qysh? (94.1 eV) (5.35) 


and (5.21) gives the Cowsik-McClelland bound [2, 3] 
my « 48eV (5.36) 
or, if we impose the stronger constraint deriving from (5.22), 


my « 12.7 eV. (5.37) 


5.3 Axions 


5.3.1 Introduction: the strong CP problem and the axion solution 


We have already alluded in section 4.7 to the infinity of topologically distinct 
vacua in electroweak theory that derive from the non-trivial (third) homotopy 
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class of the electroweak SU(2) gauge group, as noted in (4.147). Since SU(2) 
is a subgroup of SU(3), similar conclusions apply to QCD and 


13(SU(3)) = Z. (5.38) 


Indeed the pure SU (3) gauge theory has the well-known ‘instanton’ solutions [4], 
which approach these vacua as |x| — oo. These have (Euclidean) action Sg 
satisfying 


87? 
E= Braal, (5.39) 
83 
Here q is the Pontryagin index and is given by 
g ^ 
—3. | dx (G G^") (5.40) 
167? 


and it counts the number of wrappings of the $?, that is the SU (2) group manifold, 
by the unitary matrix U/3(x) specifying the (pure gauge transformation) vacuum at 
infinity: GK” is the gluon field strength. See, for example, [5]. The consequence 
of this is that the true QCD vacuum, the so-called '0-vacuum', is a superposition 
of these states 


l9) = Y e" Iq) (541) 
g 


where |q) is the ‘vacuum’ with Pontryagin index q. Then, if we define Vj as the 
operator that changes the winding number by one unit, so that 


Vilg) = la +1) (5.42) 
we see that the 0-vacuum is an eigenstate of V; with eigenvalue e. This means 


that the effective Lagrangian has an additional piece (a so-called '-term") 


Lee = L+ 983 Ga Ga" (5.43) 
32:2 "" 


which is parity (P), time-reversal (T) and CP non-invariant. 
A similar additional term also arises when an axial U(1) transformation is 
performed on all of the quark fields: 
U(1)4 : q > eq. (5.44) 


The axial current iD, defined by 
iP = V ararsa 
q 


= Yasvaan — dLvuatl (5.45) 
q 
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where qr and q; are the chiral components of q, is anomalous [6] (see also 
chapter 4 [33, 34]) (see [5] for an account of this). In fact, for massless quarks, 


(5) NIR? na z 
an jf = rg, oon (5.46) 
where Np is the number of flavours (= 2NG). Thus, as in (4.126), such a 0-term 
can be removed by performing the U (1)4 transformation (5.44) with 


0 
=-—. 5.47 
"--37WD (5.47) 
With all quarks massless, the 0-term is not physical, since it can be removed 
by an (unobservable) U (1)4 transformation. However, this is not the end of the 
problem, since quarks are not massless. The contribution of the mass terms to the 
QCD Lagrangian is 


Lm = —q1i MijdRj — FriM} qL) (5.48) 


where i, j = 1,..., Ny label the quark flavours, and Mj; is the mass matrix. The 
effect of the U(1)4 transformation (5.44) on M is 


U(DA : M > eM 
Mt > e Mt, (5.49) 


Thus, if M was initially Hermitian, so that there are no yss in the mass terms, 
it is no longer so after the transformation and the transformed Lagrangian has 
reacquired the P and T non-conserving interactions which the transformation 
(5.44) with o satisfying (5.47) sought to remove. The quantity 0 defined by 


6 = 6 + 2N p arg(det M) (5.50) 


is invariant under U (1)4 transformations and parametrizes the T-violation in the 
(strong) QCD Lagrangian: 0 is the effective QCD vacuum angle in the basis 
where all quark masses are real, positive and ys free. If non-zero, it contributes 
to the neutron electric dipole moment d, and the measured upper bound on this 
requires [7] 

8 € 10710, (5.51) 


The outstanding question, then, is why 8 is so small. This is the ‘strong CP 
problem'. 

For each value of the parameter 0, we have a different QCD theory and there 
is no a priori reason why one (very small or zero) value is preferred over another. 
A possible escape from this is that 8 is the expectation value of a field 8(x), whose 
VEV is determined dynamically by an effective potential, as happens when the 
electroweak symmetry is spontaneously broken by the Higgs-doublet field. Then 
it is conceivable that, at the minimum of the effective potential, 9 = 0. 
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Before addressing the solution of the strong CP problem, we should note that 
CP-violation might also arise in electroweak theory from terms analogous to the 
QCD -term (5.43), namely 


6293 
322? 


Each of the 0-terms is a total divergence (see (4.134) and (4.136)) and so can 
only contribute surface terms to the action. However, if we consider Euclidean 
path integral configurations with finite action, the field strengths Fuy must fall 
off faster than 1/r? as r — oo, where r is the Euclidean distance. For the U(1) 
case, this requires that the vector potential B,, falls off faster than 1/r and then 
the surface integral is negligible as r — oo. The basic reason is that the @,-term 
has a trivial topological structure. We therefore drop it henceforth. In contrast, 
the 62-term is necessitated by the non-trivial topological structure (4.147) of 
SU(2). It contributes non-zero surface terms to the action, even though it is a total 
divergence (4.136). In this case, there are Euclidean path integral configurations 
with finite action in which the field strengths W,, ~ 1/r* asr — oo but 
W, ~ l/r. For these configurations, the surface terms are not negligible. 
However, we have already noted that of the four global U(1) symmetries of 
the standard model, associated with baryon number (B) and the lepton numbers 
(Le, £ = e, u, t), three, namely iB- Le, are exactly conserved, and the 
remaining one, say U (l)p, is anomalous, as noted in (4.130). Thus, in a manner 
directly analogous to that previously described for U(1)4 (when the quarks were 
massless), we may perform a U(1)g transformation of the quark fields which 
removes the 62 term. So 6» is evidently not an observable parameter and the only 
observable 6 parameter is that associated with QCD. 

This observation indicates that a possible solution of the 6 problem is to 
introduce a further U (1) symmetry, designed to allow the removal of the 0-term 
by an appropriate transformation which changes arg det M. This is the solution 
proposed by Peccei and Quinn [8] in which the symmetry group of the standard 
model is augmented by an additional global, chiral U(1) symmetry, known 
universally now as the U(1)pg symmetry. However, we cannot do this with 
just the minimal Higgs content of the standard model. This is because if we 
use the U(1) to rephase the Higgs doublet by a phase factor e, say, the down- 
type masses are rephased by e, but the up-type by e~#, so that arg(det M) 
is unaltered. Thus we must introduce additional scalars. Further, the scalars 
must not have equal and opposite U(1) charges, since otherwise the previous 
objection still applies. It follows that the U(1) cannot be the existing U(1)y 
of the standard model, so a new U(1)pg is required. Additional global U(1) 
symmetries arise quite commonly in semi-realistic compactifications of heterotic 
and type I/II string theories. Thus, their introduction to solve the 0 problem 
seems less unattractive now than when it was first proposed. Generically, at 
the minimum of the effective potential, both the local gauge symmetries and 
the global symmetry are spontaneously broken. Then, by Goldstone's theorem 


a yrapv og? pV 
Wa, Woe” + "tBu BM. (5.52) 


Lo = 
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(see [5] for a discussion), there is a scalar (Goldstone) boson having zero mass at 
the Lagrangian level. This is the axion a(x). It is associated with the phase of the 
U (1) pg transformation and, under a U(1)pg transformation parametrized by a, 
it transforms according to 
a(x a(x 
a), ae) (5.53) 
UPQ UPQ 
where upg is the VEV associated with the spontaneous breaking of the U(1) po 
global symmetry. Under the same transformation, a chiral fermion ( f) transforms 
as 


f(x) > eU? f(x) (5.54) 


where x ¢ is the Peccei-Quinn (PQ) charge of f. Then the (PQ) current associated 
with the symmetry is 


(PQ) — 


— vpgü"a +} xs fy"f (5.55) 
f 
and this is conserved at the classical level, because of the U(1)pg symmetry. 


However, because it is a chiral symmetry, the symmetry is anomalous, just as 
U (1)4 is. The anomaly has a form similar to that in (5.46): 


9(8,0) 


g? 
au j(PQ) — ¢, 83 83 Ga GM b T Wa, We" +61 T B, B^" (5.56) 


32n2 "" 


where the parameters £; (i = 1, 2, 3) are model-dependent constants determined 
by the U (1) pg charges of the (chiral) fermion states. 

The axion field a(x) appears explicitly in the Yukawa couplings of the 
fermions to the scalar fields: it is these couplings which generate fermion mass 
terms when the gauge symmetry is spontaneously broken. We now make a local 
transformation of the fermion fields: 


f(x) (5.57) 


f(x) > exp Ed 


chosen so that the axion field is removed from the Yukawa terms. Because it is a 
local transformation, the fermion kinetic terms generate (derivative) interactions 
with the axion field 


fy"i, f > fy"ib, f + si Gun f v" f (5.58) 


and because U(1)pg is anomalous (see equation (5.56)), extra non-derivative 
axion interactions are generated: 


g 
Lanom = x E s = 3G4,G" + 6525 Way wo +8155 


g? 
3202 "" I zl 


(5.59) 
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Thus, the effective eme is 


Lett = Lsm + ni + ri 7 Ou yaw" 
3202 
0 £ iW wav £i g? 51 
2+ pg ^? T6 + vag ^? 32m Bub 
+ 540)? + eet — vpo (dua)] (5.60) 
UPQ 


where j(?2)# is given in (5.55). Thus, the anomaly generates a potential for a(x) 
and it is no longer true that all values of (a) are allowed in the vacuum nor that 
the axion field is massless. In fact, in the 0-vacuum, Peccei and Quinn showed 
that ^ 

84 os; llaw) = (5.61) 


so that the T-violating QCD o-term i is cancelled. We have already noted that the 
other 8-terms may be dropped, so what remains is an effective Lagrangian in 
which the physical axion field a(x), given by 


@(x) = a(x) — (81a(x)10) (5.62) 
has interactions with the gauge field strengths and the matter fermions: 


£g 
+ Xavp 


£392 
32x?vpQ 


3riosg Pe" + 50,0) 


1 
+ gpg (Un o» — vpo (8,à)]. (5.63) 


Lest = Lsm + 5 — G(x) Go, GP" + —2572 a(x) Ws, Wa" 
o 


Effectively, the offending T-violation has been removed by replacing the 6- 
parameter by a dynamical (axion) field. So the next task is to determine the 
physical properties of the axion and their implications for experiment. 


5.3.2 Visible and invisible axion models 


The properties of the axion may be calculated using current algebra techniques 
[9-1 1, 14] or by an effective Lagrangian technique [9, 12, 13, 15]. The former give 


(5.64) 


a 


1 7 
ma ~ 0.62eV ( 0 em 


where 
fa —— (5.65) 
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fixes the strength of the axion coupling to the gluon field strength in (5.63). We 
can see roughly how this estimate arises by noting that this coupling provides an 
effective potential for the axion so that 


82 Ver 1 g a ~ 
2 € 3 a apv 
= (—“)--__3._ — (G! Gs 
Ma (Ss) Fa 32n2 pa Cure? 
4 
_ Seep 
f2 


where the last estimate is just on dimensional grounds [16]. Thus comparison 
with the current algebra result (5.64) would imply Agcp ~ 80 MeV, which is a 
bit low but in the right ballpark. Although the axion is not massless, it is clear that 
any reasonable scale of symmetry breaking will give f; >> A gcp, which implies 
a very light axion as the price for solving the strong CP problem. For example, 
fa ~ v ~ 250 GeV and (5.64) gives mg ~ 24 keV. 

The effective Lagrangian (5.63) shows that the axion will decay to two 
photons with a Lagrangian of the form 


Layy = ED - B. (5.66) 


(Without confusion, we may suppress the hat now.) This decay mode 
a — yy (5.67) 


will be dominant unless 
Ma > 2m,. (5.68) 


The strength g, may be derived from (5.63) and is given by 
_ *& 
y= 
255 


so it is completely determined by the Peccei-Quinn (PQ) charge assignments. 
Similarly, from (5.63) the axion coupling to the fermion f is 


(5.69) 


l — 
Lf = ——— (3a) > Xf, FY nay f (5.70) 
UPQ X=R,L 
where 
ay=45(ltys) forx — R,L. (5.71) 


Equivalently, L f can be written in the form 


Ly - igy TL (5.72) 
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with the strength gf given in terms of the PQ charges of the right- and left-chiral 
components of f by 
Bf =X fp —Xfr- (5.73) 


There is by now overwhelming evidence that the original ‘visible’ axion, 
characterized by 
fa^ vpo ~ v ~ 250 GeV (5.74) 


does not exist [15]. We mention briefly some of the laboratory-based experiments 
that lead to this conclusion. The coupling of the light quarks u, d to the axion 
may be expressed in terms of isoscalar and isovector combinations in an obvious 
way. The isovector part (41) determines the mixing between the axion and the x 
and, since the decay rate for pion beta-decay 


mt — nÜety, (5.75) 
is well known, the rate for the process 
nt — aetv (5.76) 


can be reliably predicted in terms of the isovector amplitude (A1). Now, if the 
mass of the axion satisfies (5.68) it decays rapidly via the process 


a —> ete^ (5.77) 


and a bound on the branching ratio for this process can be inferred from the 
measured branching ratio [17] for the process 


nt — ete etw (5.78) 


This requires that the isovector amplitude is small, 
Bal £2 x 107? (5.79) 


and this is sufficient to exclude the ‘short-lived visible axion’ models satisfying 
(5.68), since A, is predicted to be large in such models [15]. 
However, if the mass of the axion satisfies 


mg < 2m. (5.80) 


it can only decay slowly, via the process (5.67). In this case, there are strong 
experimental bounds deriving from the failure to detect axion production in 
various beam dump experiments. In such experiments, many different processes 
may produce axions and while it is difficult to calculate individual processes 
reliably, they contribute incoherently and cannot all vanish. Thus, the production 
Cross sections for the processes 


pN — aX 
eN — aX 
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and the interaction cross section for 
aN > X (5.81) 
can be confidently estimated and collectively they require [18] 
ma S 50 keV. (5.82) 


Thus, if the solution of the strong CP problem is to be found using the PQ 
mechanism, the axion must be 'invisible' in these experiments. All models which 
achieve this use an SU (3) x SU(2) x U(1) singlet scalar field o having a non- 
zero U(1)pg charge, which acquires a large VEV (vpg > v) so that the beam 
dump bound is satisfied. One way to achieve the invisibility is if the known 
quarks and leptons have zero U(1)pg charge but there exist some new quarks 
(X), presumably very heavy, having non-zero PQ charge. Such a possibility 
was proposed by Kim [19] and by Shifman et al [20] and the axion is called 
the 'KSVZ' or ‘hadronic’ axion. The coupling to the scalar field ø is given by 


LESYZ = -hX Lo Xg 4- h.c. (5.83) 


and there is no (tree-level) coupling to the leptons. Another possibility, suggested 
by Dine et al [21] and by Zhitnitskii [22], is that the known quarks and leptons do 
carry PQ charge so, as in the original model, two Higgs doublets H ? are required 
but they are coupled to the PQ field o only via a term in the Higgs potential having 
the form. 

yDFSZ — X Hir; Hao 4 h.c. (5.84) 


This was discussed in [5]. The axion in this model is called the ‘DFSZ’ or ‘GUT’ 
axion. Although differing considerably in their physical input, the models make 
similar predictions for the coupling strength g, of the axion to two photons: 


gp? = — 0.96 


gDFSZ — 0.37. 


5.3.3 Astrophysical constraints on axions 


The experimental requirement discussed earlier that axions, if they exist, must 
be 'invisible' implies that their coupling to photons, leptons and hadrons is very 
weak. This is most naturally achieved by making fa ~ vpg very large which, 
from (5.64) in turn entails ma being very small. For example, for a GUT axion, 
we might expect fa ^ upg ^ UGUT = O(10!5 GeV) and then (5.64) gives 
m, 7-107? eV. In principle, any weakly interacting particle having a mass smaller 
than typical stellar temperatures, i.c. in the keV-MeV range, can provide an 
additional mechanism for a star to cool, besides the standard neutrino emission. 
Of course, the interactions must be strong enough to ensure sufficiently copious 
production of the particle so that large amounts of energy can be carried away 
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by the new coolant but weak enough for the coolant to stream away without 
undue hindrance from too many interactions. Since stellar evolution models are 
well developed and successful in accounting for the observed stellar lifetimes, the 
axion production cross sections and, hence, the strength of its various couplings 
are constrained by the error bars on the observational data [23-27]. 
For example, in globular cluster stars, axions may be produced by the 
Compton process 
ye — ae (5.85) 


shown in figure 5.1, or by axion bremsstrahlung 
eZ > aeZ (5.86) 


shown in figure 5.2. The production cross section for both of these and, hence, 
the stellar cooling rate is proportional to Bee where, using (5.72),(5.73), (5.65) 
and (5.64), 

Me (xen — XeL)Mame 


mg n eR L KoL) MaMe 5.87 
Bee = ee po — EX(0.62 x 1016 eV?) (5.87) 


The observational data yield the constraint [23, 28] 


l8aee! S; 0.5 x 107? (5.88) 
so that 
Gen — xe) |, < 0.62 x 10-2 eV (5.89) 
£3 
which gives ma € 1072 eV as the generic constraint on DFSZ models, taking the 


unknown PQ charges to be of order unity. Of course, the mass of the hadronic 
axion is unconstrained by these data. 

The globular cluster data also constrain the axion-photon coupling, which 
enters via the Primakoff process 


yea (5.90) 


shown in figure 5.3, in which a photon is converted to an axion in the coherent 
electromagnetic field of a nucleus or an electron. The production cross section is 
proportional to Bey where, from (5.66) and (5.64), 


Qem Ma8yAem 


= gy — = —— 5.91 
Bary = By Tf, (0.62 x 1016 eV2) 691) 
and the data yield the constraint [27] 
Igayy| € 0.6 x 1079 GeV. (5.92) 


Then 
lgylma S 0.16 eV (5.93) 
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Figure 5.1. Axion production via the Compton process. 


and ma S 0.4 eV for both the DFSZ and KSVZ axions. 

Similar arguments may be applied to the cooling of neutron stars. In the 
supernova SN 1987A, thermal neutrinos transported away the binding energy of 
the newly formed neutron star in about 10 s [29, 30], in accord with theoretical 
calculations. The possibility of other mechanisms for removing the energy is, 
therefore, constrained and, for axions, the axion-nucleon coupling is constrained, 
which, in turn, constrains the mass [31,32] to satisfy 


ma € 0.01 eV. (5.94) 


5.3.4 Axions and cosmology 


If they exist, axions would be produced in the early universe and the relic axions 
have important implications for current and future observations. In principle, 
axions may be produced thermally or non-thermally and two distinct non-thermal 
mechanisms have been proposed. 

The discussion of thermal production is straightforward. At high 
temperatures, axions are created (and destroyed) by photoproduction or 
gluoproduction on quarks: 


yq e aq (5.95) 
gq € aq. 


162 Relic neutrinos and axions 


7a 
y ^ 

e ae e 

(2,A) (Z,A) 
sa 
4 
L 
Pd 
L 

e Eae e 

(Z,A) (Z,A) 


Figure 5.2. Axion bremsstrahlung. 


When the temperature drops below that of the quark-hadron phase transition 
T € Agcp ~ 175 MeV, by the pion-axion conversion process (5.90) 


aN «aN. (5.96) 
Each of these processes has an associated absorption rate 
Ds = nr (olvl)abs (5.97) 


where nr is the number density of the axion's target T = q or N, o is the 
scattering cross section, v is the relative velocity of the axion and the target T 
and (...) denotes a thermal average. If the expansion rate of the universe is slow 
compared with the total absorption rate Fabs, then we expect that these processes 
will eventually achieve thermal equilibrium with the standard (relativistic) axion 
number density given by equation (5.2) with e; = 1. However, if the axions 
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Figure 5.3. Photon-axion conversion: the Primakoff process. 


interact too weakly, the total absorption rate is too slow for them ever to reach 
thermal equilibrium and their number density freezes out with a value below nig eq. 

To quantify this, we use the abundance defined in (5.11) with the equilibrium 
value given in (5.12) with gq eg = 1. The Boltzmann equation determining the 


evolution of Y, is 
dYa 
dt = —Tabs (Ya = Ya,eq)- (5.98) 


Thus, Y, (t) always lies between its initial value and Y74: 


t 
Y,(t) — Ya eq = (Ya (0) — Ya,eq) exp ( -f labs ar). (5.99) 
0 


It is convenient to recast the integral in terms of the variable 


=—. 5.100 
x T ( ) 
In the radiation era the scale factor R(t)  t!/?, so that the Hubble rate 
1 
H => aT? ax? 5.101 
x x ax ( ) 


and the relic abundance may be written as 


E Y,(0) * Tabs(x’) / 
Y,(x) = Yaa — ( — Y. es ) exp(- f XHG) dx J (5.102) 


Below the quark-hadron phase transition, the nucleons are non-relativistic 
and have an equilibrium number density given by (5.1) in the limit T « my, 


3/2 
my \ v 
"nN—ENIIT— e. (5.103) 
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The thermally-averaged cross section is 


(o|vl)abs = 82y wx "mz? (5.104) 
with 
m 
gann © L, (5.105) 
fro 
In the radiation-dominated era, 
1/2 2 
81p 2n (ng.MV2 my 
H =|=] => . 5.106 
(=) 3 ( 5 ) x?mp ( ) 


Putting all this together, we get 


3/2. 1/242 
Fas(x) _ 38N (à my mp ELE 
H(x) 2x 8g. frqms 


104172 Ma 2 ; 
z {— a) oy -3/2,- 
N (2) (iz 103 x) x e (5.107) 


using (5.64). Above the quark-hadron phase transition, the processes (5.95) 
dominate and I'yy;/H scales as x, achieving its maximum value just after the 
transition. Thus, we can estimate the final relic abundance as 


Ya 
Ya (c0) = mi -= ( -= 2) 
Ya,eq 
2 
xexp| — 38N (E) my mp. I (xA) 
P 2n3 8g. fPQmz qh 

0.278 
8«,dec 


104172 ma 2 
x exp -(2) (zu) I(xqa) 


(5.108) 
where 


oo + 
I (xgh) = f x75 dx! 
Xgh 

2 - 
=- Slag "Orgs — 1) + 2/7 (erf( /Xqn) — 1)] (5.109) 


and g, aec is the value of g, at decoupling (freeze-out). The parameter xqa ~ 5 is 
the value of x at the quark-hadron phase transition, so that Z ~ 107^. Thus, if the 
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mass of the axion were greater than about 0.1 eV, the relic abundance would be 
near to the equilibrium abundance. In fact, masses this large are already excluded 
by the data, so the relic abundance depends upon the initial value. For example, 
with ma saturating the SN 1987A bound (5.94), we estimate that 


Y, Y,(0 
Ya(o9) 9,007 +. 0.99312" (5.110) 
a,eq Ya. 


At high temperatures, it is plausible to assume that there are no axions, so that 
Y,(0) = 0, in which case the relic abundance is very far from thermal. In any 
event, it is clear that thermal axions cannot provide anything like the measured 
matter density. As in (5.16), we define Qn to be the fraction of the closure energy 
density provided by thermai axions: 


oth = Pa (5.111) 
Pe 
Then, analogously to (5.35), we find that 
ma = = = Erde oth 2(130 eV). (5.112) 


Saturating the measured value 620 of the current mass density would require 
mg ~ 18 eV for closure, a value which is clearly excluded by the observational 
bounds already obtained. 

In all of the foregoing discussion, it was tacitly assumed that the classical 
axion field had a constant value, in fact the value (5.61) needed to ensure that 
the strong O-term vanishes. However, in the early universe, when the temperature 
T ~ fa > Agcp, the U(1)pg symmetry is broken and massless axions are 
created. The potential which gives the axions a mass arises from non-perturbative 
instanton effects only when the temperature drops to 7 ~ Agcp. Thus, at high 
temperatures, there is no reason why the axion should have the preferred value 
for which à. = 0. When instantons generate a potential for the axion field, it 
will roll towards the preferred value, so the foregoing assumption that the field 
is a constant is not true during this era. This ‘misalignment’ of the field with its 
ground-state value means that there is a non-zero axion field energy density which 
we shall now calculate. 

We assume that the axion field is spatially homogeneous and depends only 
on time. Then, from (5.63), the effective axion action is 


s= fats Jah à? — sma? +Taå) 
= [os R3(t)(4a? — 1m2a? + Pad) 


where R(t) is the cosmological scale factor and we have retained only the 
quadratic (mass) term in the axion potential. The equation of motion is 


SRG +T,)] + R3m2(T)a = 0. (5.113) 
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The decay width T, of the axion is tiny, so we may safely ignore it henceforth. 
Initially, at high temperatures T >> A gc p, the axion is massless and we assume 
that à = 0. Then the field is constant: 


a(t) — ai (5.114) 


where aj is the initial ‘misaligned’ value of the field. As the temperature falls 
m? (T) increases and the equation of motion is 


G+3Ha+m?2(T)a =0 (5.115) 


where, as usual, H = Å /R is the Hubble parameter. Eventually, the temperature 
reaches 7; at which 
ma(Ti) = 3H(T)) (5.116) 


and, thereafter, a(t) oscillates with frequency m,(T). The energy density 
associated with the axion field is 


Pa = 4a? + mia? (5.117) 
so, using (5.115), 
ba = ramsa? — 3Ha. (5.118) 
Averaging over one oscillation 
(a?) = m? (a?) (5.119) 
and then (5.118) gives 
. Ma 
(Pa) = (= - 3H) (0a) (5.120) 
Ma 
whose solution is 
(Pa) R? (t) x ma (T). (5.121) 


Thus, the axion number density ng = (05) / ma(T) scales as RÌ (t), even though 
the axion mass is varying. The entropy density s also scales in this way, so 
assuming that there has been no entropy production since the axion field began 
to oscillate, their ratio is conserved. When the temperature T = Tij, given by 
(5.116), 

pa = 3m2 (Tia; (5.122) 


since initially à = 0, and 


Ng 45ma(T;)a? 45a? 
=| =— Fe SS (5.123) 
5 im 41? g,T, 2 /5ng.Timp 


The present (misaligned) axion energy density is given as a fraction of the closure 
energy density by 
Ha $0 


mis 
Qmis.. Pad Ta) ma, (5.124) 
Pe $|r Pe 
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Now, recall that the axion field satisfies (5.61), so a; is given in terms of the initial 
value 0; of the angular field 0 by 


ai = fa; (5.125) 


where fa is given by (5.65), and 6; can be anywhere in the range (—7., 7). The 
temperature 7; found by solving (5.116) is about 1 GeV for an axion of mass 
ma = 1075 eV and 


Ti ma 0.18 
~(—2-) . 12 
1GeV (1s x) (5.126) 
Then » 
. 10 m -1.18.. 
amis ~ 9 16572 ( — —— ;. . 
a (2) (53 s) i (9.127) 


Note the negative power dependence on mg. If we replace 6; by its root mean 
square (rms) value 7/4/35 in the range (~x, 7), we see that the axion energy 
density from the vacuum misalignment does not exceed the measured matter 
density (5.22), provided that the mass of the axion is greater than about 107? eV. 

However, it is not clear that we are justified in replacing 0; by its rms value. 
In each causally connected domain, we expect 0; to have an independent value. 
If these values are randomly set, then it is reasonable to replace 6; by its rms 
value provided that the observable universe is composed of many such causally 
connected domains. We shall see later that there is good evidence that there was 
a period of inflation in the early universe and, if the reheating temperature after 
this is over is lower than that of the PQ symmetry breaking, then the observable 
universe is composed of only about one causal region and we have no a priori 
reason for selecting any particular value of 0; in our patch and, consequently, no 
way of estimating Q5, [n any case, (5.127) is only a rough estimate. There are 
theoretical uncertainties, which amount to a factor A ~ 1-3, deriving from the 
PQ-model dependence and the nature of the QCD phase transition and also from 
anharmonic corrections which give a factor f ( 8i) when the initial value 0; is in 
a region where other terms in the axion potential are important, besides just the 
quadratic terms which we have retained. 

Further, the homogeneous oscillations of the axion field correspond to 
the creation of zero momentum axions and it has been argued that non-zero 
momentum axions, with a momentum spectrum g(k), are created before the 
temperature drops to T ^ Agcp by other non-perturbative effects. In 
consequence, the axion density deriving from the ‘misalignment’ effect is an 
underestimate of the actual density, as we shall see. The U(l)pg symmetry 
with which we are concerned is directly analogous to the global U (1) symmetry 
relevant to a superfluid ^He condensate at low temperatures. In this system, it is 
known that besides the ordinary bulk superfluidity, analogous to our homogeneous 
axion field, there are also vortex configurations in which the phase of the order 
parameter (or the pair wavefunction) varies spatially, although its magnitude 
remains constant (determined by the density of the superfluid condensate). Such 
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topological configurations arise because the fundamental group of the manifold 
S! associated with symmetry group U (1) of the ground state is non-trivial: 


m(S!) =Z. (5.128) 


As we traverse any closed path threaded by a vortex, the phase of the condensate 
varies continuously and changes by an integral multiple of 27 when we return to 
the starting point. By shrinking the size of the closed path, it is clear that there 
is a linear vortex on which the phase of the order parameter is undefined. In a 
real superfluid, there is a cylindrical core region, centred on this line, in which the 
magnitude of the order parameter varies, approaching zero on the line. 

Similar considerations apply to our U(1) pg symmetry. In the early universe 
when the PQ symmetry is broken, we expect the axion field to vary spatially, since 
it is uncorrelated beyond the horizon. These topological considerations (5.128) 
indicate that a random ‘axion string’ network will, therefore, be formed [33] just 
as vortex configurations are formed in superfluid “He. The thickness of the core 
region is à ~ f; '. Roughly, there are two types of string: long strings, spanning 
the horizon, and small string loops. The loops oscillate and radiate axions, and this 
is the dominant energy-loss mechanism [11,34]. The axions are massless when 
they are emitted and the emission continues until they acquire a mass via instanton 
effects when the temperature drops to T ~ Agcp. A numerical simulation of a 
random network of (global) axion strings has been performed recently [35, 36]. 
This shows that, after a short initial period of relaxation, the network evolves to 
a 'scaling' regime, in which the large-scale behaviour of the network scales with 
the Hubble radius and the energy density is given by 


ping = bu (5.129) 


where £ is a constant and y is the string tension per unit length. Such behaviour 
was predicted theoretically by Albrecht and Turok [37].The radiated axions have 
a momentum spectrum g(k) which is peaked around wavelengths of order of 
the horizon scale (k-! ~ (4t H ) and which decays exponentially for shorter 
wavelengths. The contribution Qn £ to the current fractional relic axion energy 
density is calculated as follows: 


Mma \-118 
10-5 s) 


which is somewhat larger than, but comparable with, the value obtained from 
the misalignment mechanism if we take the rms value for 6;. So applying the 
measured matter density bound (5.22) requires the axion mass to be greater than 
about 107? eV, as before. The numerical simulation was performed on a 256? 
lattice but it has been noted [38] that this might not be sufficient to observe 
logarithmic corrections, proportional to t7? In t, to the scaling behaviour (5.129). 
Such corrections would have the effect of enhancing axion production at later 


gine ~ (0.39 + 0.26)h~? ( (5.130) 
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times, thereby reducing the lower bound on the axion mass needed to avoid 
overclosure. Of course, if the reheating temperature after inflation is lower than 
that of the PQ symmetry breaking, then axion strings and the radiated axions 
are washed out and essentially all of the relic axions come from the coherent 
oscillations of the zero mode discussed earlier. 

Axions produced at a temperature around T ~ Agcp when they acquire a 
non-zero mass are non-relativistic and they are, therefore, candidates for cold dark 
matter (CDM). Studies of large-scale structure formation indicate that CDM is an 
important component and, if so, it gets trapped in the gravitational potential and 
contributes to the galactic halo. Such axions might be detected experimentally 
using a cavity permeated by a strong static magnetic field [39] via the Primakoff 
process (5.90). When the cavity frequency is tuned to the axion mass, the galactic 
halo axions convert resonantly into photons of that frequency. A cylindrical cavity 
of radius ] m has a lowest TM mode of frequency f — 115 MHz, corresponding 
to an axion of mass mg = 0.475 x 107 eV. Experiments using a tunable cavity 
of this size have produced exclusion zones in the axion mass versus axion-photon 
coupling (ma, gayy) plane. All of these are normalized assuming that the local 
halo (CDM) density is entirely in the form of axions. The conversion power of the 
resonant cavity is proportional to 82yy and, until recently, the power sensitivity 
levels were too high to bound theoretically favoured models. However, a recent 
experiment at LLNL [40] has excluded KSVZ axions in the range 


2.77 x 10 8 eV < ma < 3.3 x 10 eV (excluded) (5.131) 


and further experiments are underway at LLNL and Kyoto with sufficient 
sensitivity to detect DFSZ axions at even a fraction of the local halo density. 


5.4 Exercises 


l. Verify the form (5.63) giving the interactions of the axion with the gauge 
fields and fermions. 

2. Show that axion decay into two photons is described by the Lagrangian 
(5.66) with g, given by (5.69). 

3. Show that axion decay into an electron-positron pair is proportional to g2,, 
where gaee is given by (5.87). 

4. Verify the expression (5.108) for the final axion relic abundance. 


5.5 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e  Peccei R D 1989 The Strong CP Problem in CP Violation ed C Jarlskog 
(Singapore: World Scientific) 
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Kolb E R and Turner M S 1990 The Early Universe (Reading, MA: Addison- 
Wesley) 

Muryama H, Raffelt G, Hagmann C, van Bibber K and Rosenberg L J 2000 
Axions and other very light bosons (Review of Particle Properties Groom D 
E et al Particle Data Group) Eur. Phys. J.C 151 
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Chapter 6 


Supersymmetric dark matter 


6.1 Introduction 


The cosmological bounds on the masses of various known or hypothethical relic 
particles derive from the requirement that the total energy density of the relic 
particles X does not exceed the measured present total energy density pp. In 
terms of dimensionless quantities, this gives 


$2x,9 < Ro (6.1) 


where Qy 9 and Qo are defined in (5.16) and (1.41) with Qo given by (5.19). 
Some of the relics are known. For example, the present photon energy density 


py = ETA, with To = 2.73 K = 2.35 x 1074 eV the present temperature of the 
cosmic microwave background, gives Qyh? ~ 2.471 x 1075. Thus, 


Qy = 5.1 x 1075 (6.2) 


taking h = 0.7148% as in (5.14). Similarly, and as noted previously, the 
measured primordial deuterium and helium abundances require that the baryon 
energy density gives Q5? = 0.019 + 0.003. Thus, 


Qp = 0.039 + 0.004 (6.3) 
and baryons constitute not more than a few percent of the total. Relic neutrinos 
also contribute and we may invert (5.35) to obtain a contribution of 


~ 47.4eV 
for each relativistic species with m, >> To. The current experimental bound for 


the electronic species from the Mainz and Troitsk tritium beta-decay experiment 
[1) is far more restrictive: 


(6.4) 


vp 


my, « 2.2 eV at 95% CL. (6.5) 
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Further, this bound effectively applies to all neutrino species, since the SuperK 
[2] and SNO [3] data show that the mass differences for atmospheric and solar 
neutrinos are tiny: 


Am? amos 7:3 x 1073 eV? (6.6) 
Am? solar 7: 7 x 1075 eV?. (6.7) 


Thus the sum of the neutrino masses is at most, 6.6 eV and likely to be much 
smaller. (When my < To, the relativistic fermion density gives 2, = 0.232,.) 
In addition, simulations of structure formation in a neutrino-dominated universe 
are unable to reproduce the observed structure. The best fit to all of the 
cosmological data [4] gives 

Q, < 0.015 (6.8) 


which for three degenerate species implies 


my, < 0.23 eV at 95% CL. (6.9) 


e 


It is, therefore, clear that relic neutrinos too constitute only a small fraction of the 
total energy density. We might have anticipated that the dominant contribution to 
the present energy density would come from the matter comprising the galaxies 
and the contribution of the baryons is the largest. However, it is nowhere near 
large enough to account for the total energy density. Thus, we may be confident 
that the major contributions to Qo are not from known sources. In any case, as we 
shall shortly see, there is strong evidence that there is a large amount of invisible 
‘dark matter’ in the universe [5]. 

The possibility of dark matter was first suggested by Kapteyn [6] in 1922, 
who noted that its mass could be estimated from the velocity distribution of stars 
in our galaxy. The strongest evidence for its existence comes from measurements 
of rotation speeds of spiral galaxies. If we consider a star moving with speed v(r) 
in a circular orbit of radius r outside of a spherically symmetric mass distribution 
with a total mass M (r) interior to r, then 


GnM(r) =rv(r)?. (6.10) 


The speed v(r) can be determined for luminous objects such as stars or gas clouds 
by measuring the Doppler shifts in emission or absorption lines and the mass 
distribution M (r) is then inferred from (6.10). The mass of a spiral galaxy can be 
determined by taking r to be the radius within which most of the light is emitted. 
In this way, the average galactic mass (mga) can be calculated. Combining this 
with the measured number density nga determines the average energy density 


(Plum) = Mgal (Mgal). (6.11) 
These measurements show that the contribution of luminous matter is 


Qium < 0.01. (6.12) 
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In other words, luminous matter accounts for less than 1% of the mass of the 
universe. When these techniques are applied to the rare stars and neutral hydrogen 
(HI) clouds beyond the radius where light from the galaxy is emitted, it is 
found that M(r) continues to increase, reaching a maximum in the range 150— 
300 km s^! within a few kpc, and then remaining constant out to the largest radii 
at which HI clouds can be found. If there were no matter outside of the luminous 
region, then from (6.10) v(r) should fall off as r~'/2. Thus, the measurement of 
roughly constant values of v(r) in over 1000 galaxies indicates that the galaxies 
have huge ‘halos’ of dark matter, with mass 3—10 times that of the luminous 
component. The rotation curve for our own Milky Way galaxy is difficult to 
measure, because the observer is inside the galaxy but there is little doubt that our 
galaxy too is immersed in a dark matter halo. Further, by studying the motion of 
galactic clusters a universal mass density corresponding to [7] 


Rmh? ~ 0.1-0.3 (6.13) 


can be inferred. In fact, it was measurements of cluster galaxy dynamics that led 
to the discovery of dark matter by Zwicky [8] in 1933. Recent data on the acoustic 
peaks in the cosmic microwave background (CMB), combined with independent 
data from simulations of cluster formation, high-z supernovae, quasars, and the 
Lyman alpha forest, give the best-fit values [4] 


Qo = 1.02 +0.02 QA? = 0.13570 008 (6.14) 
Q4 =0.65 +0.05 24h? = 0.0224 + 0.0009 (6.15) 


where Qm = Pm/fc is the total matter contribution, distinguished from the 
cosmological constant contribution 24 = Pyvac/Pc = A /Hà, and pc = 3M2 H 
is the critical density defined in (1.37), and k = 0.7149-08. Clearly, Qn 4 Qp. 
So there must be non-baryonic dark matter and the first problem is to identify 
its nature. There is also a second problem, which is to explain the discrepancy 
between the observed luminous matter density given in equation (6.12) and 
the calculated baryon density (6.3) required for the success of the primordial 
nucleosynthesis calculation. We shall have little to say about the latter problem, 
save to note that it seems at least possible that it can be solved by a combination 
of dark stars, intracluster gas and the Lyman alpha forest [9]. 

In this chapter, we first characterize the general properties that dark matter 
particles possess, whatever they are. Since there are no satifactory candidates 
within the standard SU(3) x SU(2) x U(1) theory of strong and electroweak 
interactions, it is natural to look for suitable candidates in the (minimal) 
supersymmetric version of the standard model, the MSSM. One possibility, that 
gravitinos make up the dark matter, arises in any locally supersymmetric theory. 
It is studied in section 6.3. However, the most popular view is that dark matter is 
made of neutralinos. The parameters of the MSSM that control the mass and other 
properties of the neutralino are detailed in section 6.4. In the following section, 
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we discuss the bounds that can be put on the neutralino mass using cosmological 
data and also the constraints on the parameters of the MSSM that arise from other 
data whose theoretical prediction depends upon these parameters. We discuss the 
prospects for the experimental detection of (neutralino) dark matter in section 6.6. 


6.2 Weakly interacting massive particles or WIMPs 


To have survived until the present epoch, any (non-baryonic) dark matter particles 
must either be stable or have a lifetime comparable with the present age of the 
universe. Further, if the dark matter particles have electromagnetic or strong 
interactions, they would bind to nucleons and form anomalous heavy isotopes. 
Such isotopes have been sought but not found [10]. Thus, the dark matter 
particles can, at best, participate in weak (and gravitational) interactions or, at 
worst, only in gravitational interactions. One obvious possibility satisfying the 
foregoing constraints is that the dark matter consists of neutrinos. However, 
we have already noted that the present data on neutrino masses (from tritium 
decay, atmospheric and solar neutrino experiments) show that although neutrinos 
might barely account for the inferred mass density (5.22) or (6.14), simulations 
of galaxy formation and cluster formation require cold dark matter. That is, the 
dark matter is made of weakly interacting massive particles (WIMPs). No such 
particles exist in the standard model but they do in its enlargement to the minimal 
supersymmetric standard model (MSSM). 

We can estimate the relic density of WIMPs using the same techniques as 
those used for relic neutrinos in section 5.2. The difference is that the equilibrium 
abundance for a cold (i.e. non-relativistic) fermion species X is obtained from 
(5.1) by taking the limit T « my. The result is 


T 3/2 
NX,eq = 8X (=) e"x/T. (6.16) 


Roughly speaking, freeze-out of such relics occurs when their annihilation rate 
Ta becomes equal to the Hubble rate H. The annihilation rate is given by 


F4 = nx,eg(cAlvl) (6.17) 


where c4 is the annihilation cross section, v is the relative velocity of the 
annihilating WIMPs, and (...) denotes an averaging over a thermal distribution 
of velocities of each particle at the decoupling (freeze-out) temperature Tgec. The 


Hubble rate is ; 
8&1G T 

H =f EON? Lee LT —. (6.18) 
3 mp 


The abundance Yx Tẹ. of X-particles at freeze-out is, therefore, given by 


NX eq, Tec Haec 
Yx ty = ——— -———— (6.19) 
w Sdec Sdec (0a |V|) 
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where 2 
2x 
Sdec = "aS ES Tae Tiec (6.20) 


is the entropy density at freeze-out, with e, s, r defined in (5.10) As before, this 
gives the current abundance Y x,o. Equating (6.17) and (6.18) then yields 


PXx,0 $0 
= Yx,o—mx 
Cc c 
H. 
= —7X_ 50 Mec (6.21) 
(cAlvl) Pc Sdec 


mx 1 (? x oa) 


2x0 = 


= — 6.22 
AT (6.22) 


i Tiec Af 84. Tiec 


The proportionality of Qy,9 to the inverse of (o,4|v{) means that the relic 
abundance is reduced as c4 increases and this might have been anticipated: the 
more efficiently annihilation proceeds, the fewer relics remain. Taking gy = 2, 
the freeze-out temperature satisfies 


-1/2 
(=) e"x/Te« — 0.076(oA|vl) ZZE = K (6.23) 
Téec 8, Tac 


which may be solved iteratively 
TX ~ink + 4inink. (6.24) 
Táec 


For a typical value g,, 7, = 60 and a typical weak cross section 


2 100 GeV V? 
(cAlvl) = canon -c (mE) 2.5 x 1077 cm? s^! (6.25) 
m, mx 
where c is of order unity, this gives 
mx mx 
Thus, ; 
6 x 10727 cms! 
Qy ge án (6.27) 
(cAlvl) 
and, using (6.22), a typical weak cross section (6.25) gives 
2.3 mx 2 
£&xo-——l——-—— . 
ERE" (ioo Gv) (6.28) 


remarkably close to (6.14) for mx = 100 GeV since k? ~ 1. 
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The foregoing analysis shows that the dark matter might well be WIMPs but 
there remains the question of what the WIMPs actually are. Since there are no 
satisfactory candidates within the standard model, we must investigate plausible 
extensions of it. The most favoured, which has been studied in great detail in 
recent years, is the MSSM. (See, for example, [11]). We shall say more about the 
MSSM in section 6.4. First we discuss the possibility that dark matter is made of 
gravitinos, particles that occur in any locally supersymmetric theory. 


6.3 The gravitino problem 


In a supersymmetric theory, all particles have an associated superpartner, called a 
‘sparticle’, whose spin differs by j from that of the original particle. In a (locally 
supersymmetric) supergravity theory, the sparticle associated with the (spin-2) 
graviton has spin j and is called the 'gravitino'. When supersymmetry is broken, 
the gravitino acquires a non-zero mass (see chapter 5 of [11], for example) 


man = eO0/2, p (6.29) 


where Gy is the expectation value of G in the physical vacuum, with G defined in 
(2.144), and mp — G, = 1.22 x 10!9 GeV is the Planck mass. If gravitinos 
have survived until the present epoch, then their energy density p3/2,0 could, in 
principle, dominate but not exceed [12] the present total energy density po of the 
universe. Thus, 


23/2 < Qo (6.30) 
where Qo is defined in (1.41) and 
Q,,2 B22. (6.31) 
pc 


As for neutrinos, we can use this to bound m3,2. Since gravitinos interact only 
gravitationally, their interaction rate 


T5 
T3/2,int ~ Gi, Ti ~ =z (6.32) 
mp 


just on dimensional grounds. They decouple when 


T? 
D3/2,im = H ^ —- (6.33) 
mp 


which occurs when T ~ m p and while they are still relativistic. After decoupling, 
the number of gravitinos per comoving volume is constant so, as in (5.24), 


Y3/2,dec = Y3/2,0. (6.34) 
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87 (3) g3/2ett8 S, To 


GN To maja (6.35) 
31 BaS, Tas ° 


QH = 


where g3/2,eff = 1 x 4, since the gravitino has spin-3, Putting all of this together 
gives 
m3/2 = 23/2h* g45,174,.(4.4 €V) € 1 keV (6.36) 


taking g«s,r,, ~ 200 and using the value (1.42) for $29. This is far smaller 
than the <10 TeV scale needed to protect the hierarchy and to give TeV-scale 
masses to the sparticle spectrum, remembering that m3/2 controls the size of the 
supersymmetry-breaking masses for matter. 

A more likely scenario is that gravitinos are heavier but decayed before the 
present epoch. The fastest decay mode of the gravitino is into a standard model 
particle and its supersymmetric partner, with rate 

m3 
P3/2,max  — 1. (6.37) 
mp 


When the temperature T ^ m3/2, the expansion rate H ~ m2 ,/mp, which 
is faster by a factor mp/m3/2. This means that the equilibrium condition 
F > H can only be reached at temperatures far below m3;2 and, at these 
temperatures, collision processes are too weak to produce gravitinos. So the 
gravitino population can only be reduced by decays. Until they decay, the cosmic 
energy density is dominated by gravitinos with energy density 


_ 363) eT) as 


——— m 6.38 
Pei guo (639) 


where g(T) is the effective number of massless degrees of freedom at temperature 


T. The expansion rate 
1/2 
8x P 
H = [8% Ps (6.39) 
3 mp 


becomes equal to the decay rate [3/2 when T ~ 73/2, where 


, 1/3 [T2 m2, V7? 
Typ ms) 3m \ (6.40) 
8¢(3)9(T3/2) m3/2 


When they decay the energy is thermalized and reheats the universe to a 
temperature 


Tj» (6.41) 


_ (90: (3) 4 [maT 
EE. 8(T3/2) 


Minimal supersymmetric standard model (MSSM) 179 


Using (6.40), this gives 


452(T, 1/4 
T3 = (oin) /T3y2m p. (6.42) 


41?8g?(T3/2) 
If we use the fastest decay rate (6.37), we find that T; n< To ~ 3K if 
m3/2 € 10 MeV. The increase in temperature after reheating 


T; m 1/6 
Az > 454 2r (3)! 3g 73/2 (Ty )g VY TS) (25) (6.43) 
3/2 msn 


results in a large increase in the entropy density, by a factor (m p/m3 uu ? since 
s « T?. So if the gravitinos decay after nucleosynthesis, the baryon number to 
entropy ratio n g/s during nucleosynthesis must have been much higher than now. 
This leads to too much helium and not enough deuterium [13]. To avoid these 
problems, we require that T 2 > 0.4 MeV, so that any previously formed helium 
nuclei are broken, the neutron-to-proton ratio is restored to its equilibrium value 
and nucleosynthesis restarts. If we use the fastest decay rate (6.37) in (6.42), this 
gives 

m3/2 Z 10 TeV (6.44) 


which might, just, be consistent with solving the hierarchy problem. Whether 
or not the maximal decay rate (6.37), associated with the decay to a particle and 
sparticle, actually occurs depends on the details of the supergravity model used. 
For example, if the gravitino is the lightest supersymmetric particle (LSP), then 
if it decays it can only decay to non-supersymmetric particles, with a decay rate 
less the maximal. This decreases both 73/2 and T; / thereby exacerbating the 
problem. 


6.4 Minimal supersymmetric standard model (MSSM) 


Although it is not required theoretically, it is customary to impose an 'R-parity' 
invariance on the supersymmetric standard model to ensure the absence of fast 
proton decay. The quantity R is defined as R = (—1)°(8-4)+25, where B is 
the baryon number, L is the lepton number and S is the spin. It is assumed to 
be multiplicatively conserved in all interactions. Then the most general MSSM 
has a total of 124 independent parameters [14]. These are comprised of three 
gauge coupling constants (21,2,3), three gaugino masses (M1,2,3) and two gaugino 
phases, four Higgs/Higgsino sector mass parameters (m2, , mA, B, p) and one 
phase, nine fermion masses, 21 scalar squark and slepton masses, 39 mixing 
angles and 41 phases and 8 (the QCD 6-parameter discussed in section 5.3.1). 
This model is sometimes referred to as MSSM-124 [15]. In contrast, the 
standard model has *only' 19 parameters: the three coupling constants, two Higgs 
parameters (m, v), nine fermion masses, three mixing angles and one phase, 
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and 0. Since there are two Higgs doublet chiral superfields H, and H4 in the 
MSSM, there are two VEVs, v, and vg. The combination v2 + và is fixed by 
the measured value of mz and is, therefore, not a free parameter but the ratio 
tan B = v,/w is a free parameter. The MSSM has five physical Higgs particles 
of which two (H *) are charged, two (h°, H?) are neutral scalars and one (AU) 
is a neutral pseudoscalar. The masses are all fixed in terms of the (known) gauge 
coupling strengths and three parameters, two of which (u, tan B) have already 
been defined. Without loss of generality, the third may be taken to be m4. Apart 
from @ and the CKM mixing angles and phase, the unknown parameters of the 
MSSM therefore consist of 63 masses and mixing angles and 43 phases. Even 
if all of the phases are set to zero, it is still not feasible to explore the remaining 
parameter space (M1,2,3, 44, tàn B, ma, 21 scalar squark and slepton masses and 
36 mixing angles). 

The number of parameters is, therefore, drastically reduced by making 
further assumptions. In the low-energy approach [16], special phenomenolgically 
viable points in the parameter space are selected. For example, the five scalar 
(squark and slepton) symmetric 3 x 3 mass matrices and the three trilinear 
coupling Á-matrices might be assumed to be generation independent or that they 
are flavour-diagonal in a basis where the quark and lepton mass matrices are 
diagonal. Neither of these has any strong theoretical motivation. Alternatively, 
in the high-energy approach that we shall follow, the parameters of the MSSM 
are treated as running parameters. In other words, the parameters 'run' or evolve 
with the renormalization scale in a way determined by the renormalization group 
equations. Then a structure is imposed on the parameters at some high energy 
scale. This would be the case if there is an underlying GUT symmetry, for 
example. In such a model, it is assumed that the gauginos all have a common 
mass 73/2 at some (a priori unknown) unification scale m y: 


Mi(mx) = M2(mx) = M3(mx) = my. (6.45) 


The gaugino masses at the electroweak scale are determined using renormaliza- 
tion group equations and at the electroweak scale their ratios are determined by 
the gauge coupling strengths: 

M3 a3 Mı 501 


= _— 6.46 
Mrz œ M; 3a2 ( ) 
where a3 = g2/4n etc. Similarly, it is also assumed that all scalars, except 


possibly the Higgs soft masses squared m» have a common squared-mass m 


and that the trilinear cofficients have a common value A, at the unification scale. 


m^, (mx) = mg; (mx) = my (mx) = mil; (6.47) 
mi (mx) = mz (mx) = mols (6.48) 


Ay = Aq = Åe = Als. (6.49) 
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This happens (in supergravity models) if the origin of the supersymmetry 
breaking is a ‘hidden’ sector which shares only gravitational interactions with 
the ‘observable’ sector that we inhabit. (The unification scale, as well as the 
value of the unified gauge coupling strength, is determined from the measured 
low-energy values of the coupling constants using the renormalization group 
equations.) An advantage of this approach is that one of the diagonal Higgs 
mass-squared parameters is typically driven negative by the renormalization 
group running, so electroweak symmetry breaking is thereby generated radiatively 
and the scale at which this happens is intimately connected to the low-energy 
supersymmetry breaking. The minimal supergravity (mSUGRA) model starts 
with seven parameters (if we allow for a non-minimal Kahler potential), namely 
mi, m3, u(mx), A, B, mi, me. Using the renormalization group equations, 
these determine mz, tan B (and m2): it is customary to choose tan f as an input 
parameter and mz is, of course, fixed. Then || and B are outputs and the 
remaining unknowns are m?, m2, €(u(mx)), tan B, mj/2, mo. When m? = m3 = 
mj, the model is called the ‘constrained’ MSSM or CMSSM. 


Since R = +1 for all particles in the standard model and R = — 1 for all 
of their supersymmetric partners (sparticles), it is easy to see that the lightest 
supersymmetric particle (LSP) must be stable. To be a WIMP candidate, the LSP 
must also be a colour-singlet and electrically neutral and there are relatively few 
sparticles with these properties. One posiibility is a sneutrino v. However, this 
possibility has been excluded. An accelerator-based limit from the ‘invisible’ 
width of the Z boson requires m; 2 44.7 GeV [17] but, in this case, direct relic 
searches in low-background experiments require m; > 20 TeV [18]. Another 
possibility that arises in supergravity models is the gravitino, which is essentially 
undetectable. Also, as we saw in section 6.3, gravitino dark matter might raise 
theoretical problems that supersymmetry is supposed to have solved. However, in 
most supergravity models, the gravitino is not the LSP and is unstable. The most 
popular candidate by far is that the LSP is a neutralino [19]. 


6.5 Neutralino dark matter 


There are four neutralinos x9(n = 1,2,3,4) in the MSSM, each of which is a 
linear combination of the four & — —1 Majorana fermions: the Wino W3, the 
partner of the SU(2);, gauge boson; the Bino B, partner of the U(1)y gauge 
boson; and the two neutral Higgsinos A, and Hg. Thus, 


xl = Ni B + Non W? + Ns, Hi, + Nan Ha — (n=1,2,3,4) (6.50) 
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and the coefficients Nj,(i = 1,2,3,4) are the normalized eigenvectors of the 
neutralino mass matrix 


Mı 0 —mzcgsw  mzspsw 
M. = 0 M2 mzcgcw | —mzsgcw (6.51) 
x —mzcgsw  mzcpgcw 0 -pu ` 
mzsgsw | —mzsgcw -pu 0 


where cg = cosf, sg = sinf, cw = cos@w and sw = sinOw. The mass 
eigenvalues are conventionally labelled in ascending order, so xe is the lightest 
neutralino and x? the heaviest. Besides the (known) parameters mz and Ow that 
appear in the standard model, the neutralino mass matrix involves four of the 
further 63 parameters that specify the (fairly) general MSSM discussed earlier. 
These are M1,2, the (soft) masses of the U(1)y and SU(2), gauginos, and 
the Higgsino mixing parameter u. In mSUGRA models, (6.46) holds and the 
neutralino masses and mixing angles are determined by only three parameters. 

The question to be addressed then is whether for certain values of these 
limited parameter sets the MSSM has the neutralino x? as the LSP and, if so, 
whether the predicted relic density is consistent with the observational data (6.14). 
To answer the latter question, the cross section for neutralino annihilation must 
be calculated in the MSSM and then used to calculate the relic density which 
is compared with the observational data on cold dark matter. Subtracting the 
baryonic contribution $25 from the total matter contribution $2, in (6.14) gives 
the 2c range for the cold dark matter density satisfying 


0.094 < Qcpmh? < 0.129. (6.52) 


Before discussing these calculations, we should note that a precise determination 
of the relic density requires the solution of the Boltzmann equation governing the 
evolution of the number density ny. The estimate (6.27) is a fairly good estimate 
when c4|v| is approximately constant, independent of v. Since the neutralinos 
are non-relativistic, we may generally expand the annihilation cross section as 


oalvol 2 a t bv? +. (6.53) 


where the a term receives contributions only from s-wave scattering, the b-term 
from s- and p-waves, and so on. If a >> b, then o,|v| is indeed approximately 
constant. However, as we shall see, this is often a poor approximation because 
the dominant annihilation channel has the s-wave suppressed because of CP- 
invariance considerations. Thus, the p-wave is dominant and the estimate (6.27) of 
the relic abundance is a poor approximation. The true abundance can be computed 
by a numerical integration of the Boltzmann equation but an improved analytical 
approximation can also be found by solving in both the early- and late-time limits 
and then matching the two solutions near freeze-out. The result is [20] 


$0 
Q0 = Yy,o—my (6.54) 
Pe 
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with the present neutralino abundance Y, 9 given by 


.78 1 
nos —— 5 (zz ) o (6.55) 
MBs T m pmy \ Tiec) a +3(b — a/A)Taec/ my 


and the freeze-out temperature satisfying 


Lt = In[0.0764m p(a + 6bTaec/ m , )c(2 + c)m y (8s, Tagg! | Tac) ^^] (6.56) 
which can be solved iteratively; c is a numerical constant of order unity 
determining when the early- and late-time solutions are matched: c ~ j typically 
gives a 5-10% precision. In special circumstances this estimate can be wrong 
by factors of two or more. These are (i) when the annihilation occurs near an 
S-channel pole; (ii) when the annihilation occurs near a mass threshold; and (iii) 
when there is 'co-annihilation', i.e. when there is another particle (x^) (e.g. a 
squark) with a mass that only slightly exceeds m,, and the x can be converted to 
a x’ via scattering from standard model particles. If the annihilation cross section 
for the x's is larger than that of the xs, then the abundance of both is controlled 
by the annihilation of the heavier and more strongly interacting particle. These 
special cases are important in practice and allow certain regions of the MSSM 
parameter space that would otherwise be forbidden. 

The coefficients a and b in (6.53) are bounded above by partial-wave 
unitarity arguments, with the bounds being of order my, essentially on 
dimensional grounds. This leads to a model-independent lower bound on the 
relic abundance [21] ; 

my 

2x02 (orv i 
Using the upper bound in (5.22) then gives my < 100 TeV. Of course, 
in the MSSM models with which we are concerned, the cross sections are 
proportional to o2,, so the largest cosmologically acceptable WIMP mass will 
be reduced by a factor of dem ~ 107? from this most conservative bound. 
Thus, in supersymmetric models, we expect m, S ! TeV to be required by the 
cosmological constraint. 

The calculation of the annihilation cross section in the MSSM is 
straightforward in principle but quite complicated in practice and we shall only 
comment on the salient features. The most important channels for neutralino 
annihilation are those that appear in lowest order (tree-level) perturbation theory, 
see figure 6.1. These are annihilation into a pair of fermions 


x») ff (f-aqLv» (6.58) 


and into a pair of bosons 


(6.57) 


xox? + w*w-, Z929, w*n*, 2949, z9u9, 2949, HHHO 
(6.59) 
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Figure 6.1. Feyman diagrams for neutralino annihilation processes: x, is any of the 
neutralino or (in the t-channel) chargino states; h is the lightest (neutral) Higgs scalar, 
H is the other neutral Higgs or charged Higgs; A is the pseudoscalar; f is a fermion and f 
is the corresponding sfermion. (Not all processes are allowed.) 
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and all three pairs from A, H°, h°. 

Analytic expressions for the a terms for all of these processes are compact 
but the b terms are very involved (22]. Neutralinos are Majorana fermions and 
are, therefore, their own antiparticles. This means that, in an s-wave state, the two 
neutralinos must have their spins oppositely directed because of Fermi statistics. 
Therefore, if the neutralinos annihilate to a fermion ( f)-antifermion pair ( f» 
then f and f must also have their spins antiparallel, and this implies that the 
amplitude acquires a factor my to account for the helicity flip. Another way 
to see this is to note that the initial s-wave x? x? state has CP = —1, so CP- 
invariance requires that the final state also has CP = —1. Annihilation into light 
fermion pairs will always be kinematically allowed but the previous argument 
shows that the s-wave contribution to, and therefore the a term in, the annihilation 
cross section is proportional to m2 /m?. Thus, annihilation into light quarks and 
leptons is negligible compared with annihilation into c, b and t-quark pairs: the 
latter occurs only when m, > m, and dominates all other channels when it is 
open. CP-invariance also affects other amplitudes. For example, annihilation 
into Higgs bosons can be important when such channels are open. However, the 
s-wave amplitudes for the final states h°h°, H0 H0, H°n®, A040. HHH” are 
identically zero because of CP-invariance: the same is true of the Z°A° final 
state. 

The allowed parameter space is restricted by other data [23] besides the 
cosmological bounds (5.22). Specifically, the LEP bound on mp, and b — sy 
data, both force the parameter mi;2 to larger values, while the BNL E821 
measurement of the g — 2 factor of the muon [24] favours relatively low values of 
mo and m;/2, at least for u > O—the actual bounds are dependent on tan B. There 
is also the requirement that the neutralino is the LSP and that the parameters allow 
radiatively driven electroweak symmetry breaking. The current position seems to 
be [25] that the MSSM can simultaneously satisfy all of these constraints. [n the 
most constrained model, the CMSSM, there is a ‘bulk’ region in the (1/2, mo) 
plane with relatively low values of mo and m1/2 in which both the cosmological 
and the non-cosmological constraints are satisfied, see figure 6.2 taken from [25]. 
In this region, supersymmetry is relatively easy to detect at colliders. The 
constraints deriving from the precision WMAP data have substantially reduced 
the size of this region. The bulk region is essentially defined by using the 
expression (6.54) and, in this region, the neutralino is essentially the Bino (B), 
i.e. Nio >> N20, N30, Nao in (6.50). In this case the annihilation proceeds mainly 
via t-channel sfermion exchange. 

Extending from the bulk region to larger values of m/2 is a co-annihilation 
*tail', where the neutralino LSP is almost degenerate with the next-to-lightest 
sparticle, usually the stau 7. At larger values of mo, close to the region where 
radiative electroweak symmetry breaking is no longer possible, there is a 'focus- 
point’ region in which the neutralino has a larger Higgsino component, i.e. N30 or 
Nao in (6.50) are non-negligible. Lastly, when both mo and m2 are large there 
may be a ‘funnel’ where rapid direct-channel annihilations via the A and H Higgs 


186 Supersymmetric dark matter 


1000 


800 
> 
2 600 
e 
E w 

200 

0 
S 
ad 
e 
E 


9 10 20 30 40 


tan D 
(b) 


Figure 6.2. The ranges of m y allowed by cosmology and other constraints, for (a) u > 0 
and (b) 4 < 0. Upper limits without (upper solid line) and with (dashed line) the g, — 2 
constraint are shown for 44 > 0: the lower limits are shown as lower solid lines. Note the 
sharp increases in the upper limits for tan 8. 2 50, u > 0 and tan B > 35, u < 0 due to 
the rapid-annihilation funnels. Also shown as dotted lines are the ë; and x* masses at the 


tips of the co-annihilation tails. 
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boson poles occurs. For u > 0, the neutralino mass is constrained to satisfy 
108 GeV € m, € 370 GeV (6.60) 


with the minimum value occurring at tan B. = 23. For uw < O, there is no 
compatibility with the g — 2 data when the LEP data are used Thus, u > 0 is 
clearly favoured but if the g — 2 data are excluded the corresponding bounds are 


160 GeV € m, € 430 GeV. (6.61) 


It is beyond our scope to discuss the status of the CMSSM in any further detail 
and the interested reader is referred to [25, 26]. The main point is that the MSSM 
and even the CMSSM are consistent with all current data and have a neutralino 
with mass in the range (6.60). In the absence of new theoretical motivation for 
particular values of the parameters, the most urgent need is for more experimental 
data. One way to obtain this is to detect neutralino dark matter and to ascertain its 
properties. 


6.6 Detection of dark matter 


The most direct signal for neutralino dark matter would be to observe its scattering 
from nuclei in a detector. By fitting both the luminous and dark matter to the 
measured rotation curve in our galaxy, the dark matter density at the position of 
the solar system is found to be of order 0.3-0.7 GeV cm^?. If the halo of the 
Milky Way consists of WIMPs, then this means that hundreds to thousands of 
them pass through every square centimetre each second. For a typical neutralino 
with my ~ 100 GeV scattering from a xenon nucleus with mx, ~ 130 GeV, 
with a typical WIMP speed i ~ 270 km s^, the nuclear recoil energy is below 
100 keV (exercise 3). This energy is transferred to atomic electrons and produces 
detectable ionization. With a typical MSSM cross section, assuming coherent 
interaction with the xenon nucleus, this gives an event rate of less than 1 kg"! 
day~!. This is about 106 times lower than the ambient rate from background 
recoils due to gammas from the surrounding natural radioactivity. Nevertheless, 
it is feasible to distinguish between the two because the rate of energy loss 
with distance (dE/dx) is a factor of 10 lower for nuclear recoils. However, 
any background neutrinos, produced by cosmic-ray muons for example, produce 
nuclear recoils that are indistinguishable from WIMP recoils. Thus, the detector 
must be shielded from the muons by placing it deep underground. The velocity 
of the earth through the galactic halo varies during the year as the earth orbits 
the sun. This leads to an annual modulation of the dark matter event rate, with a 
maximum each year on 2nd June + 1.3 days when the earth’s motion is aligned 
with the sun’s motion around the galactic centre and a minimum six months later. 
Due to the high inclination of the earth’s orbital plane, this only amounts to a 5- 
7% change in the mean recoil rate. This annual modulation is the signature sought 
by all of the current detectors (DAMA, ZEPLIN-I, EDELWEISS and CDMS). 
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However, it is possible that the much larger low-energy background might be 
subject to other modulating effects. The direction of the WIMP ‘wind’ felt by 
the detectors is strongly peaked in the direction opposite to the solar motion, so 
the recoil directions will be strongly peaked in the same direction. If observed, 
this feature, combined with the annual modulation, would be the most convincing 
demonstration of the existence of WIMPs. 


6.6.1 Neutralino—nucleon elastic scattering 


If WIMPs solve the dark matter problem, they must have some small but finite 
coupling to ordinary matter—otherwise, they would not have annihilated in the 
early universe and would be overabundant today. By crossing symmetry, the 
amplitude for WIMP annihilation into a quark—antiquark pair is related to the 
elastic scattering of the WIMPs by quarks. Thus, we expect that WIMPs will 
have a small coupling to nuclei and may, therefore, be detectable by nuclear 
scattering. The calculations are considerably simplified because the WIMP 
velocity (vy /c ~ 10-3) is extremely non-relativistic. This feature also simplifies 
the conversion of the scattering cross sections from quarks into the scattering 
cross sections from the nuclei making the detector. In the non-relativistic limit, 
the axial vector current XP y, ys X? is just the WIMP spin and it is coupled to the 
nucleon spin. Since the neutralino is a Majorana fermion, it has no vector current 
xtv. x? = 0. So the only other possible term in the effective interaction is the 
scalar x? x? which couples to the mass of the nucleus—in the non-relativistic 
limit the ‘tensor’ current reduces to the scalar. 

At tree level, the axial vector (spin) interaction receives contributions from t- 
channel Z boson and s-channel squark exchange, while the scalar interaction gets 
contributions from t-channel Higgses H, h as well as s-channel quark exchange. 
See figure 6.3. Since the lightest Higgs might be considerably lighter than the 
lightest squarks, its contribution to the scalar interaction could be significant if 
the neutralino state (6.50) has a substantial Higgsino component. Because it 
is proportional to the mass number A of the nucleus, the scalar amplitude will 
dominate for heavy nuclei. For a neutralino that is a pure Bino B, this occurs for 
A 2 20 in the large squark mass limit and this is confirmed by numerical surveys 
of the supersymmetric parameter space where scalar dominance for A > 30 is 
almost always found [27]. 

It is of interest to examine the theoretical implications for the direct detection 
experiments of restricting the parameters of the CMSSM to the region allowed 
by the cosmological and other constraints. This programme has so far been 
undertaken [28] only in the case that A = 0. The LEP lower limit on mp and 
the b — sy data provide upper limits on the cross sections, while the g — 2 data 
provide lower limits, at least if u > 0. In that case, the overall conclusion is that 
the spin-independent cross section os; satisfies 


2 x 107 pb € øs; € 6 x 107 pb (6.62) 
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Figure 6.3. Lowest-order Feynman diagrams for neutralino-quark elastic scattering: (a) 
and (b) contribute to the spin-dependent amplitude; (b) and (c) contribute to the scalar 
amplitude. 


and for the spin-dependent cross section, os p, 
2 x 1077 pb < osp € 1075 pb (6.63) 


If the g — 2 constraint is dropped and jz < 0 is tolerated, then there is no lower 
limit on os;. In general, it is found that os; is relatively large in the bulk region 
but falls off in the co-annihilation tail. There is strong cancellation in os; when 
HL «O0. 


6.6.2 WIMP annihilation in the sun or earth 


If the galactic halo is composed of neutralino WIMPs, the WIMPs have a small 
probability of elastic scattering with the sun and/or the earth. The WIMPs that 
scatter to a velocity smaller than the escape velocity become gravitationally bound 
to that body!. Once captured, the WIMPs undergo further scattering from the 
elements of that body and settle to the core in a relatively short time period. 
Thus, the sun and earth are like (inefficient) cosmological vacuum cleaners, 
constantly sucking in WIMPs that are then stored in their cores. WIMPs that 
have accumulated in this way can annihilate, essentially at rest, to produce 
standard model particles most of whose decay products are absorbed without 
any observable consequences. However, some of the decay products include 
energetic muonic neutrinos (Vy, Vy) that can pass through the sun and earth and 


! For the sun, the escape velocity at the centre is ve = 1354 km s^! and at the surface vs = 
795 km s^! ; for the earth v; = 14.8 km s™!, v, = 11.2 kms. 
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be detected in astrophysical neutrino detectors on earth. The most promising 
technique for detection is via the observation of upward-moving muons produced 
by charged-current interactions of the muonic neutrinos with the rock below the 
detector. Neutralinos annihilate almost always to two-body final states, each 
carrying energy m,, and the decays of these can produce muonic neutrinos with 
energies between m, /3 and m, /2. The lower bound on the neutralino mass is 
conservatively 40 GeV, so the energy of the muonic neutrinos is several GeV. 
The only background is from atmospheric neutrinos produced by cosmic-ray 
spallation. This is a well-modelled and easily subtracted background. 

The first step in calculating the WIMP-induced neutrino rate is to determine 
the annihilation rate l4 in the sun (or earth). This is given by 


Ta = 5CAN* (6.64) 


where N is the number of WIMPs in the sun (or earth) and 


V. 
Ca = (oalvl) —. (6.65) 
Vi 


{aa lvi} is the annihilation cross section multiplied by the relative velocity in the 
limit of zero relative velocity, i.e. the a term of (6.53), and Vj, are effective 


volumes 
3m? T 3/2 
VE ( —P (6.66) 


where T and p are the core temperature and density respectively. The time 
evolution of N is given by 


N2C-cCAN? (6.67) 


where C is the WIMP accretion rate and the second term arises because of the 
depletion caused by annihilations. The two processes equilibriate when Ñ = 0 
and then 

ra = 1c. (6.68) 


In other words, the annihilation rate is entirely determined by the accretion rate. 
One might wonder whether enough time has elapsed for equilibrium to become 
established but, in all cases of interest, it turns out that there has been [29]. 

Thus the next step is to calculate the capture rate C, which is, in turn, 
determined by the elastic scattering of the WIMPs with the nuclei of the sun 
(or earth). We have already noted that there are just two channels for elastic 
scattering: axial (spin-dependent) and scalar (spin-independent). The former 
contributes only to WIMP capture by the sun, since a negligible fraction of the 
earth's mass is in nuclei with spin. The latter contributes to capture by both the 
sun and the earth. 

It is beyond our scope to give any details of these calculations. We merely 
note that it is relatively straightforward to estimate the axial contribution in terms 
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of the elastic (axial) scattering cross section oo for WIMPs on protons. The result 


is 
1 GeV 00 m 
cam = 1.3 x 10551 (——À J(—À3—;)5( 24 6.69 
ax x 5 my 10-49 cm? ) my (6.69) 
assuming a halo density of 0.3 GeV cm? and a dark matter velocity dispersion 
of 5 ~ 270 km s^!. S is a suppression factor with the properties 


3(và.) 
2i?x 
The capture rate by the scalar interaction is much more complicated and requires 
the scattering cross section from several nuclei. We refer the interested reader 
to [30]. 

Once the capture rate and, hence the annihilation rate is known, the 
calculation of the flux of high-energy neutrinos is straightforward. It is given 


by 
de lA dN - 
(E) An R2 » F (i), (C = Vas Vu) (6.71) 


where R is the sun-earth distance, or the radius of the earth, for neutrinos from the 
sun or earth respectively, Br is the branching ratio for annihilation into channel F 
and (dN/dE) yj is the differential energy spectrum for neutrinos of type i at the 
surface of the sun (or earth) expected in channel F at the core of the sun (or earth). 
The cross section for the production of a muon via a charged-current interaction 
is proportional to the neutrino energy, and the range of the muon in the rock is 
roughly proportional to the muon energy. Thus, the rate for the observation of 
neutrino-induced through-going muons is proportional to the second moment of 
the neutrino energy spectrum: 


dN\ 3 
— 72 
I GE). dE (6.72) 


For neutralinos giving a relic density in the range (5.22), this gives a rate for 
upward muons of 


S() 21 S(x) ~ as X — oo. (6.70) 


m m 

Dt ector = 1-65 x 1074 m2 yr^! (s a) S (=) (6.73) 
for WIMPs with only an axial coupling. S is the suppression factor occurring 
in (6.69). As already noted, this is relevant only for neutrinos from the sun. The 
results of the analogous calculation for WIMPs with only a scalar coupling cannot 
easily be summarized. Suffice it to say that fluxes as high as 1077 m? yr7!, the 
current experimental upper bound on the rate, and of at least 1074 m~? yr^!, the 
expected sensitivity of the next generation of km? detectors, can be obtained for 
parameters giving 10 GeV < my < 1 TeV. If m, > 80 GeV, the signals from the 
sun and earth are of comparable strength and the earth's signal is greater when 
my S 80 GeV, see figure 34 in [31]. 
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6.6.3 WIMP annihilation in the halo 


The foregoing proposals for observing WIMPs are the most promising techniques 
currently available. However, WIMPs have other potentially observable effects. 
In particular, their annihilation in the galactic halo can produce anomalous cosmic 
rays [32] which may be distinguishable from the familiar background cosmic rays. 

These background cosmic rays occasionally include antiprotons produced 
by spallation of primary cosmic rays on interstellar hydrogen atoms. The flux 
of such antiprotons cuts off at energies below about 1 GeV, essentially for 
kinematic reasons because the primary cosmic-ray spectrum falls rapidly as 
the energy increases. WIMP annihilation, in contrast, can easily produce low- 
energy antiprotons as a result of hadronization of the decay products. Since the 
background production of antiprotons with energies in the range 100-1000 MeV 
is well understood, it is possible, in principle, to observe the anomalous 
antiprotons, provided that the WIMP mass is not too large. 

Another signal could be the observation of ‘line’ source positrons arising 
from the direct annihilation of WIMPs into an electron-positron pair. Of 
course, there are other sources of positrons arising from the showering of 
other annihilation products but these will have a broad energy spectrum that is 
indistinguishable from the background. Although propagation through the galaxy 
would broaden the line, there are no other sources of such a peak in the energy 
range 10-1000 GeV. Observation of such a peak would give a direct measurement 
of the WIMP mass. Unfortunately (Majorana) neutralino annihilation into an 
ete” pair is helicity suppressed, as previously noted. However, if the neutralino 
state (6.50) contains a significant Higgsino component, the annihilation process 
x?x? > W*W- followed by Wt — etv, will produce a positron with energy 
peaked around m , /2. 

Similarly, WIMP annihilation in the halo into two photons would produce 
a monochromatic line at an energy of the WIMP mass. Of course, since they 
are electrically neutral, there is no direct coupling to photons but equally their 
(weak) interaction with other matter generates a small but non-zero cross section 
for annihilation into two photons via a loop diagram. Estimates of the cross 
section suggest that the signal would be barely observable with current detectors. 
However, cold dark matter predicts cusps in the density in the cores of galaxies. It 
is doubtful whether such cusps are compatible with observations but a (residual) 
peak in the density would assist the generation of a visible signal. 


6.7 Exercises 


l. Verify that the abundance 2x o of cold dark matter X is given by (6.22) and, 
hence, check the estimate (6.28). 

2. Show that the increase in temperature following reheating after the gravitinos 
decay is given by (6.43) and, hence, derive the bound (6.44) on m3,2. 


3. 
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Show that a neutralino with mass m, ~ 100 GeV scattering from a xenon 
nucleus with mxe ~ 130 GeV, with a typical WIMP speed v ~ 270 km s^, 
produces a nuclear recoil energy which is below 100 keV. 


6.8 General references 


We have found the following article particularly useful in preparing this chapter. 


Jungman G, Kamionkowski M and Griest K 1996 Phys. Rep. 267 195 
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Chapter 7 


Inflationary cosmology 


7.1 Introduction 


The inflationary universe scenario was devised by Guth [1] to provide a resolution 
to two major puzzles in the standard model of the universe, namely the horizon 
and flatness problems. As a by-product, inflation also solves problems associated 
with excessive abundances of particle relics. There are two basic versions of 
cosmological inflation which we shall refer to as ‘old’ and ‘new’ (or ‘slow- 
roll’) inflation. In both versions, the universe undergoes a period of very rapid 
expansion driven by a large cosmological constant in the false vacuum. This 
period of inflation ends when the universe has evolved to the true vacuum with 
zero cosmological constant. In the case of old inflation, the universe supercools 
in some high-temperature phase before undergoing a first-order phase transition 
to some low-temperature phase. In the case of new inflation, some scalar field 
rolls in a very flat region of a potential (where the vacuum energy is large and 
positive) and eventually rolls to a minimum of the potential with zero vacuum 
energy (cosmological constant). 

In the first part of this chapter, we shall discuss old inflation and its successes 
and shortcomings. The second part of the chapter contains an exposition of new 
inflation (slow roll inflation.) We shall see that, as well as providing a solution to 
these cosmological problems, slow-roll inflation is capable of accounting for the 
size of the density perturbations in the cosmic microwave background radiation. 


7.2 Horizon, flatness and unwanted relics problems 
We discuss these three puzzles in the standard model of cosmology in turn. 


7.2.4 The horizon problem 


The cosmic microwave background radiation (CMBR) is very homogeneous. 
However, in the standard model, the present universe consists of many regions 
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which were causally disconnected up to the time of recombination of electrons 
and photons after which the photons we now observe as the cosmic microwave 
background underwent no further scattering. The puzzle is how these causally 
disconnected regions could have ended up with the same microwave background 
temperature. 

We can estimate how many causally disconnected regions there were at the 
time of recombination. From (3.117), the proper distance at time t from any point 
to the particle horizon is 

du (t) = 2t (7.1) 


where we have taken n = j for a radiation-dominated universe. In particular, at 
the recombination time ty, 
dy (tr) = 2t,. (7.2) 


(We are assuming that for most of the time from t = 0 up until t, the universe was 
radiation dominated. This is a reasonable approximation because the temperature 
at which the transition from radiation dominance to matter dominance occurs and 
the temperature at which recombination of electrons and protons occurs are well 
within an order of magnitude of each other, respectively 0.37 eV and 0.26 eV.) 
We need to know how many horizon volumes at time t, have expanded to fill the 
presently observable region of the universe (the present horizon volume). Thus, 
we need to know the radius of the region at time f, that has expanded to the radius 
of the presently observable universe. Let the volume of the observable universe 
at the present time fo be Vo(to) and let the horizon volume at the recombination 
time be V, (t.). Since RT is constant, because of conservation of entropy, 


Voltr)  Vo(to) R(t.) Volto) (2) 


T, 


Volt) (toy (ToV 
V) (2) (2) 0 


With the universe matter dominated from approximately the time of 
recombination to the present time, so that 


(7.3) 


In view of (3.117), 


RH) œ 2 (7.5) 
we have 
tx T, (7.6) 
Thus, 32 
Vo(t,) ( x) 4 
—-——xÍi— ~ 3.6 x 10 7.7 
V. (tr) To ( ) 


for T, ~ 3.0 x 10? K and Tọ = 2.73 K. This is the number of horizon volumes 
at the recombination time that expanded to fill the presently observable universe 
(the present horizon volume.) As advertised, this is a large number. 
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7.2.1 The flatness problem 


The present value of Q, the ratio of the density of the universe to the critical 
density, is not more than one order of magnitude different from 1. Since, as in 
section 1.3, 

k 
= H2R2 
the departure of Q from | is a measure of the extent to which the universe is 
curved (when k Æ 0). The value of Q varies with time and we can estimate how 
close to 1 it would have had to have been at earlier times to be as close to | as 
it is today. The conclusion we shall come to shortly is that $2 would have been 
extraordinarily close to | in the early universe to be consistent with the present 
value of Q. 

First, let us study the way in which 2 — | varies as the scale factor of the 
uuniverse R(t) changes. Recalling that 


8-1 (7.8) 


p 87G N 
= .-—— 7.9 
x^ HR? (7.9) 
and combining with (7.8), we recover the Friedmann equation 
8 
H?R? = = GupR* -k. (7.10) 


Assuming a radiation-dominated universe, p is proportional to T^ and, for entropy 
conservation, RT is constant. Thus, we can write 


p=aR™. (7.11) 
Then, using (7.10), 
k 
$—]-4——————. (7.12) 
3nGnaR-? —k 
It is clear, therefore, that Q — 1 as R — 0. However, as the universe expands, 
Q — Oas R — ooifk = —l, and Q — oo as R — Rmax = ($n Gua)? if 
k=l. 
Next, let us estimate the value of 2 — 1 in the early universe. Write 
p - £T* (7.13) 
where, from (2.22), 
ERE PRIM. (7.14) 
30V? 78 ^J i 


Then, from (7.12), (7.11) and (7.13), 
k k 
Q-l= 


= c a (7.15) 
$nGvET*R?—k — $nGy£T? 
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for large values of T, where 


(7.16) 


In an adiabatically expanding universe, the constancy of RT implies that kisa 
(dimensionless) constant. We can estimate k from (7.15), (7.13) and (7.10) as 


&rGwp _ (Qo — 1)Hj 
aT? R 


(7.17) 


where Ho and To are the present values: Ho ~ 1.54 x 10-42 GeV and To ~ 
2.73 K ~ 2.35 x 107? GeV. For Qo differing from 1 by no more than an order 
of magnitude, 


k| € 2 x 10758, (7.18) 


We are left with an unnaturally small number for lk (unless k is strictly zero). 

This bound on i&l can now be translated into a bound on |Q — 1| at early 
times. By way of illustration, we take the value of & obtained in an SU(S) 
supersymmetric GUT and estimate the value of |Q — 1| at the grand unification 
scale and at the Planck scale. In this case, as in section 2.7, Ng + aNr = $15 and 
so 


£ = 55.5. (1.19) 


Recalling that Gy = mp, where the Planck mass mp — 1.22 x 10!? GeV, we 
get the bound at the grand unification scale, 7. = 2 x 10!6 GeV, 


IQ — 1| € 1.66 x 10755 (7.20) 


and at the Planck scale 
IQ — 1| € 1.66 x 1076! (7.21) 


Again, these are unnaturally small numbers (unless k is strictly zero). The 
problem is to find a way that conditions in the early universe could have produced 
such small numbers. 


7.2.3 The unwanted relics problem 


It is not infrequently the case that particles produced in the early universe are 
calculated to have unacceptably large relic densities in the present universe, either 
because they provide too large a contribution to the mass of the universe or for 
other reasons. For example, as discussed in section 3.10, unacceptably large 
monopole densities are produced in some GUTs. A mechanism is needed to dilute 
these densities to acceptable values. 


Old inflation 199 
7.3 Old inflation 


A solution to all three problems discussed in the last section is for there to have 
been a period of very rapid expansion of the universe (cosmological inflation) 
during which the scale factor of the universe grew by a large amount. We shall 
discuss shortly how much expansion is sufficient to solve these problems. A 
simple mechanism to produce the required expansion is for the universe to have 
supercooled in a false vacuum prior to undergoing a first-order phase transition to 
the true vacuum in the way described in section 2.9. In that case, a large positive 
vacuum energy, constant until the phase transition is completed, can drive a period 
of expansion in a de Sitter universe. After some supercooling has occurred, the 
vacuum energy density in the Friedmann equation will dominate the radiation 
energy density and the curvature term, and the Friedmann equation simplifies to 


R?  81GN 

Hil—- V 722 

R? 3 (7.22) 

where V is the vacuum energy density in the false vacuum. This is equivalent to 
the de Sitter equation with cosmological constant 


A —81GyV = 81mp V. (7.23) 


Moreover, the Hubble constant during the inflationary era has a constant value 
given by 
H? — 15. (7.24) 


During cosmological inflation, the scale factor of the universe grows 


exponentially 
R(t) x exp (/s) =e", (7.25) 


The exponential expansion means that by the time the transition to the true 
vacuum occurs, the scale factor of the universe may have increased by many 
orders of magnitude. The phase transition will be completed by the formation 
of bubbles of the true vacuum, as discussed in section 2.9. Once formed, the 
bubbles will tend to coalesce and the energy stored in the walls of the bubbles 
will be released resulting in the universe reheating. Thereafter, the universe 
will evolve as a (in the first instance) radiation-dominated Friedmann—Robertson— 
Walker (FRW) universe. However, the initial conditions for the evolution of the 
FRW universe will have been drastically modified by the period of inflation. If 
sufficient inflation has occurred, the various problems discussed in the previous 
section will be solved. We now estimate how much inflation is required for this 
purpose. 

Consider first the horizon problem. This problem will be resolved if the 
presently observable universe lies in a single region which was causally connected 
at the time of decoupling of photons from matter (the recombination time), rather 
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than containing of the order of 3.6 x 10* such regions as in section 7.2. The usual 
expression (3.117) for the distance to the particle horizon does not hold during 
the period of cosmological inflation. Instead, 


t dt’ 
dy(t) = R(t) Í ED (1.26) 


with R(t) given by (7.25) if we neglect the period of growth of dy (t) during the 
period of radiation-dominated expansion which preceded cosmological inflation. 
Thus, during the inflationary period, 


dy(t) = H^ (e”' — 1). (7.27) 


The exponential growth of dj (t) during this period means that, once the universe 
has reheated to around the critical temperature after the phase transition has 
been completed, the horizon volume is exponentially greater than it was at this 
temperature prior to inflation. (Because the energy density in the false vacuum 
is of order TA and the radiation density in the FRW universe is given by (2.22), 
reheating to a temperature of order T, occurs.) Thus, after the subsequent period 
of radiation-dominated expansion in the FRW universe, the size of the horizon 
volume at t — t, is exponentially greater than in the standard mode! of cosmology. 
It is then easy for one horizon volume at t = t, to contain many times over the 
volume which will expand to the presently seeable universe. 

Consider next the flatness problem. Let T = Te be the temperature at 
which the low-temperature minimum of the effective potential (the true vacuum) 
becomes the absolute minimum. When the phase transition is first order, the 
universe will supercool to a temperature T = T; before the phase transition is 
completed by tunnelling out of the false vacuum, as described in section 2.9, 
and reheating of the universe to a temperature T = Tg occurs, with Tr ~ To- 
The period when supercooling is occurring is a period of non-adiabatic expansion 
which modifies the discussion of the flatness problem given earlier. The flatness 
problem was cast in section 7.2 as the unnatural smallness of |Q — 1| = |k/ H?R?| 
at early times, e.g. at the (supersymmetric) grand unification scale. During 
inflation, assumed to occur at that scale, H? is given by (7.24) and is constant. 
At the same time, R grows exponentially. In (7.20), | — 1| was of order 10755. 
Thus, if R? grows by more than about 55 orders of magnitude during inflation, 
we end up with a ‘natural’ value of |Q — 1| of order 1 at the supersymmetric 
grand unification scale. It is usual to measure inflation in terms of e-folds (one 
e-fold being growth of R by a factor of e). In terms of e-folds, what we require to 
overcome the flatness problem is around 64 e-folds of inflation! This is sufficient 
to solve the horizon problem discussed above. 

Finally, turning to the unwanted relics problem, let us consider, for 
definiteness, the magnetic monopole problem. In section 3.10, the excessive 


1 The WMAP data which suggests that [£o — 1| may be two orders of magnitude less than 1 indicates 
66 e-folds may be nearer the mark. This makes little difference and we shall use 64 e-folds throughout. 


New inflation 201 


magnetic monopole contribution to the predicted density of the universe today 
derived from the size of the ratio of the monopole number density to the entropy 
density at the time of the grand unified phase transition at which the monopoles 
were produced or, equivalently, from the size of the ratio of the number of 
monopoles to the entropy. The problem can be solved if a great deal of entropy is 
generated by inflation. 

During supercooling, the entropy does not change. However, the non- 
adiabatic reheating results in increased entropy. The entropy density prior to 
supercooling is of order 72. If the reheating temperature Tg ~ Te, the entropy 
density is still of this order after reheating. However, because the volume of 
any region has been inflated by the inflation of R?, the entropy in that region 
has increased by a factor of e744‘, where Ar is the duration of the period of 
exponential expansion. If there is sufficient inflation to solve the flatness problem 
(about 64 e-folds of inflation), then the entropy increases by a factor of 2.4 x 108. 
In section 3.10, we found that for the case of a supersymmetric grand-unified 
phase transition, 24A? was 18~19 orders of magnitude greater than the predicted 
upper bound for Qh? (and a few orders of magnitude more in the case of a first- 
order phase transition). Since the entropy generation resulting from inflation 
reduces Qh? by 83 orders of magnitude, the relic monopole density today is 
insignificant. A similar discussion applies to other particle relics. 

For all its successes, old inflation has a fatal flaw. It is not possible to make 
a ‘graceful exit’ [2] from the period of inflationary expansion in a supercooling 
de Sitter universe to a reheated FRW universe. The problem arises because the 
phase transition is completed in the way described in section 2.9 by the formation 
of bubbles of the true vacuum inside the false vacuum. In the first instance, the 
vacuum energy of the de Sitter phase emerges as energy in the bubble walls. For 
the universe to thermalize, it is necessary for the bubble walls to undergo many 
collisions with other bubble walls. The trouble is that, on the one hand, sufficient 
inflation requires the nucleation rate for the true vacuum to be sufficiently low 
to allow a long period of supercooling. On the other hand, if bubbles of true 
vacuum are to form sufficiently rapidly for the bubbles to overlap and collide in 
an expanding universe, then this same nucleation rate needs to be sufficiently high. 
It turns out that these two requirements cannot be reconciled. More precisely, it 
is found that for nucleation rates low enough for sufficient inflation the universe 
always consists of clusters of bubbles of true vacuum with a few bubbles in each 
cluster surrounded by false vacuum. 


7.4 New inflation 


It is possible to retain the successes of old inflation while avoiding the graceful 
exit problem in an alternative formulation of cosmological inflation referred to as 
"new" inflation [3-5] or ‘slow-roll’ inflation. The graceful exit problem derived 
from the slow rate of bubble formation at a first-order phase transition when 
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Figure 7.1. Slow-roll inflation. The slow-roll region is between $; and $ y. 


the nucleation rate was sufficiently low to allow the universe to remain in the 
false vacuum long enough for sufficient inflation to occur. In new inflation, 
the inflationary period begins with the scalar field (expectation value) $, the 
‘inflaton’, which, for the time being, we shall take to be real, in a region of the 
effective potential V($) which is very flat. The scalar field may have reached 
this region by tunnelling through a barrier between a false vacuum and the true 
vacuum or by the false vacuum having ceased to be a local minimum as the 
temperature dropped. The scalar field is then assumed to roll slowly down the the 
flat region of the potential. We shall discuss the scalar field dynamics involved 
shortly. While this process is occurring, the value of the potential is positive 
and can drive inflation. If V (d$) is sufficiently flat in the relevant region, the 
inflationary process can last long enough to solve the cosmological problems 
discussed earlier. Eventually, $ reaches a steeper region of the potential, descends 
more rapidly towards the absolute minimum of V(¢) with V = 0, overshoots 
and starts to oscillate about the absolute minimum. Quantum mechanical particle 
creation damps the oscillation and converts the vacuum energy into the energy of 
particles. Thermalization of the emitted particles creates a radiation-dominated 
FRW universe. The whole process is displayed in figure 7.1. We shall now discuss 
each stage of the process in more detail. 

To study the slow-roll stage, we require the equation of motion for (the 
expectation value of) the scalar field $. The Lagrangian density for a (real) scalar 
field with effective potential V ($) is 


L = 53,63" — V ($). (7.28) 


(Then the action is $ — f d*x /—g£.) One way of deriving the equation of 
motion is as the covariantized Euler-Lagrange equation for this field, namely 


D, (9^9) = —V'() (7.29) 
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with the covariant derivative defined by 
D,V" = aV” c TI V^ (7.30) 


for any 4-vector V^. Assuming a homogeneous field $, so that the spatial 
gradients are zero, 
Vó — 0, (7.31) 


the Euler-Lagrange equation reduces to 
$ +Ti o + V'(9) — 0. (7.32) 


With the coefficients of affine connection for the Robertson-Walker metric as in 
section 1.2, the explicit equation of motion is 


$ó -3Hó 4 V'($) 20 (7.33) 


where H is the Hubble ‘constant’ (exercise 1). If there is a range of values of 
$ for which slow roll occurs, then in that region the motion is dominated by the 
‘frictional’ term 3H $ and the ¢ term is neglected. Then the equation of motion 
simplifies to 


ó2-— (7.34) 


F 
—|<«}. 7.35 
lcs (735 
The double time derivative ó may be estimated from (7.34) as 
M" ] : 1 - 
=-—V' -H ?HV(Q). 7.36 
$ 3H (6)ó + 3 ($) (7.36) 


An estimate of H is now required. When the vacuum energy density py 
dominates over the radiation density and the curvature terms, the Friedmann 
equation gives 

We 8xGyw 8x 


-2 
z Pv = MP Pv. (7.37) 


Also, the energy-momentum tensor for the scalar field is given by 


aL ad 
w = Sang ax SHE 0.38) 


= 9,00, — 19,,8,00^$ + guv V (0). (7.39) 
For a homogeneous field $, the vacuum energy density py is given by 


py = To = 39? + V(9). (7.40) 
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If we assume that the energy density is dominated by the potential energy, then 


H? = meV). (7.41) 


We shall see shortly that this is a consistent approximation when the effective 
potential V (¢) is flat enough to satisfy the conditions for slow roll. Returning to 
(7.36) with H estimated from (7.41), we see that 


š 


3H 78 sui v6 E yv (@)y. (7.42) 
To satisfy (7.35), we require that 
y’ 
| 9 a | <1 (7.43) 
and Bam? 
TE VH «1. (1.44) 
With H? given by (7.41), these slow-roll conditions are 
2 / 
Tu < 24r (7.45) 
and 2 
2 (Te) « 487. (7.46) 


Slow roll occurs in the range of $ for which V(¢) is flat enough to satisfy these 
two conditions. 

It can now be seen that, when the slow-rol! conditions are satisfied, the 
vacuum energy density is dominated by the potential energy. Using (7.34), the 
kinetic term in (7.40) is 


1:9  (V'($) 
5? = 9H (7.47) 
so that, using (7.41) and (7.46), 
152 2 , 2 
39^ m$ (V'(9) ) 
V($) ^ 96x (vio <l. (7.48) 


If the slow roll occurs between times 1; and ty, and the value of the scalar 
field evolves from ¢; to f during this time, then the amount of inflation, 
measured as the number of e-folds, is given by 


tR ty 
Ne = In — = Í —dt= H dt. (7.49) 
ti R 
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Thus, when ¢ is given by (7.34), 
R of d ?r d 
Ne = In =£ -f H% = -f 3p? oo (1.50) 
Ri $i $ éi v'(o) 
and when the vacuum energy density is dominated by the potential energy, as in 
(7.41), the number of e-folds of inflation is 


Ry 2 [" V($) 
Ne = In — = —8 — — dọ. . 
e n R; amp a VÆ $ (7.51) 
If we make the approximation 
V'($) = Vi) + V” (6))(6 — Gi) (7.52) 
and take 
V'i) ~ 0 (7.53) 
for a flat potential, then 
V'($) = V"($i)($ — $i). (7.54) 
Substituting this into (7.34), we find that 
y" . 
$ — $i = exp (55e — 2) . (7.55) 
Then, the motion is slow over a time period 
3H 
t~ ———— (7.56) 
IV" (Pa) 
and, using (7.49), 
Ry 3H? 
N: = In — ^ Ht ~ ——_.. (7.57) 
t Ri IV" ($i) 
With H? given by (7.41), this gives 
In Ry ~ _ 82 VG) (7.58) 


Ri mV") 
Thus, when the slow-roll condition (7.45) is satisfied, In Rf/R; is large and 
we get many e-folds of inflation. Equation (7.58) is useful as an initial test of 
whether sufficient inflation can occur. The estimate of the number of e-folds of 
inflation can be sharpened up by performing the integration in (7.51) over the 
region between $; and @¢ which are the boundaries of the region in which the 
slow-roll conditions are satisfied. 

All of this discussion assumes that the motion of $ across the flat region 
is that of a classical field. If there are significant quantum fluctuations, $ may 
cross the flat region more rapidly and these conclusions may no longer be valid. 
We shall see later that, in the de Sitter space of an inflating universe, there are 
substantial quantum fluctuations and we need to check that this effect is not 
sufficient to invalidate these estimates of N,. This will be discussed in section 7.6. 
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7.5 Reheating after inflation 


Eventually, slow roll ends when the inflaton field $ reaches a steeper region of the 
potential. The inflaton then descends more rapidly towards the absolute minimum 
of the potential, overshoots it and starts to oscillate about the absolute minimum. 
Assuming that the inflaton possesses couplings to matter fields, the oscillation is 
damped by quantum mechanical particle creation as vacuum energy is converted 
into energy of particles [6-9]. Denote the decay rate of the inflaton by l'y. We 
shall assume that l'5 < Hosc, where Hosc is the value of the Hubble constant when 
the slow-roll period ends and the oscillating period begins. 

A simple way of introducing the damping by particle emission into the 
dynamics of the inflaton is to modify (7.33) to 


$ 3Hó * T49 4- Vo) — 0. (7.59) 


Multiplying by $/2 and recalling (7.40) for the vacuum energy density py, we 
find that 
bv + (3H 4 T4)é? — 0. (7.60) 


For simple harmonic oscillations, the average of the kinetic energy over an 
oscillation is equal to the average of the potential energy over an oscillation, so 
that 


32) = (V(9)) = 3v) (7.61) 
Averaging (7.60) over oscillations, we write 
pv +BH+Tyg)pv =0 (7.62) 


where py is now understood to refer to the time-averaged quantity. We shall 
discuss later in this section the circumstances in which (7.62) is valid. 

While t < rs. neglecting the time between the big bang and the start of 
oscillations, the development of py is given, to a good approximation, by 


py +3Hpy =0 (7.63) 


which is identical to the energy conservation equation in a matter-dominated FRW 
universe. Thus, we may regard the vacuum energy density as equivalent to a gas 
of non-relativistic $ particles. During this period, 


pv x R? Rar’ and pv « 172, (7.64) 


When t ~ T; , rapid decay of the vacuum energy to emitted particles occurs 
and the universe reheats to a temperature Tg given by 


x Ng Nr) TÉ = py(t - T7!) 7.65 
30 NET g^F)'n7»€u-t, (7.65) 
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where Ng + &NF = 215 for the supersymmetric standard model and 2 for the 


standard model. If we define a scale M by 

M* = py(t — ti), (7.66) 
where 1; is the time that slow roll begins, then we still have 

pv (t = tosc) = M4 (7.67) 
because py ~ V ($) during slow roll and V ($) does not change significantly over 


the flat region in which slow roll occurs. The initial value Hos; of the Hubble 
constant when the oscillatory period starts is then given by 


Hl. = np Mt. (7.68) 
The time tos; when the oscillatory period starts is then of order 


- 3 - 
fos ~ Hol = iz" pM? (7.69) 


(or one or two orders of magnitude greater). Then, from (7.64), we see that 


_ r-l 
pv(t T4) 


= (lotos)? (7.70) 
Dv (t = tose) ? osc) 
and, using (7.69) and (7.67), we have 
(t =15') = (Tomp) (1.71) 
pvi = $ = Ba gmp). . 


Consequently, the reheating temperature Tr of (7.65) is 


45 1/4 
Tg = | ——— lom p)! 2. 7.72 
R (atop) (Tomp) (7.72) 


Note that this is not, in general, of order M. 

This discussion depends upon (7.63) for the time development of the (time- 
averaged) vacuum energy density. This is known to be correct for the oscillatory 
period if the $ particles decay only into fermions. However, when the ¢ particles 
decay into pairs of bosons $ — x x, then it is possible for very rapid decay via 
parametric resonance to occur [10], a process which generates very large numbers 
of x particles. This is referred to as ‘preheating’. However, the (radiation) energy 
density in light x particles rapidly becomes small compared to the (matter) energy 
density remaining in the oscillatory ¢ vacuum. Thereafter, the reheating process 
occurs as before and the estimate of Tg is not much altered. 
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Since all pre-existing baryon asymmetry will be diluted exponentially by the 
inflationary period, the baryon asymmetry we observe now must be generated 
after reheating has occurred or during reheating. If the reheating temperature 7g 
is sufficiently high, then the baryon asymmetry may be produced in the usual way 
by the decay of leptoquark bosons in a GUT. Lower reheating temperatures will 
suffice if the sphaleron mechanism applies instead. A further possibility is that 
the baryon asymmetry is produced by the decay of the oscillating vacuum state 
which exists after slow roll has ceased, i.e. by the decay of particles associated 
with the inflaton field $. This is the situation discussed in section 4.6 where all of 
the entropy of the universe is produced by the decay of particles whose decay is 
also producing the baryon asymmetry. Then the baryon asymmetry is 


— M — (9.73) 


where e is the net baryon number produced by the decay of a scalar particle 
associated with $. There is the weaker requirement that the reheating temperature 
should be high enough for nucleosynthesis to occur so that Tg should be at least 
a few MeV. 


7.6 Inflaton field equations 


As discussed in section 7.4, estimates of the amount of inflation occurring during 
slow roll require that quantum fluctuations in the inflaton field $ do not cause the 
flat region of the potential to be crossed too rapidly. We now show that, in the 
inflationary universe, ($2) grows linearly with time [11-13]. 

The inflaton field operator may be expanded in terms of plane-wave modes 


as 
n 21 3 ix 
o(t, x) = ani IE k (Vk (re * ay + h.c.) (7.74) 
where the creation and annihilation operators ay and al obey 
[as af,] = 8(k — k’). (7.75) 
For a massless field in a flat FRW space, the field equation 
D, (@"¢) = 0 (7.76) 
leads to . . 
Vk) + 3H vat) + R^ (Ok Val) = 0. (7.77) 
With 
R(t) = Roe! (7.78) 


during the inflationary expansion and using the variable 


n = —R3'H le (7.79) 
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the equation for the plane wave mode yg (t) becomes 
Vp -30 V, tK y, =0 (7.80) 


where the primes denote differentiation with respect to n. The general solution is 
given in terms of Hankel functions 


maA1/2 
val) =(F) PP HLS EN) + cx HD EM] 08) 
where k = |k| and the condition 
Ica? — lei? = 1 (7.82) 


follows from the canonical commutation relations for ó and its conjugate 
momentum. Retaining only the positive frequency part [13] of y/y (n), for modes 
which go through many oscillations in an expansion time, we take 


c2(k) = I, ci(k) 20 (7.83) 
for k >> RoH. Then, fork > RoH, 
n \1/2 
ve) = (2) HHE) (7.84) 
~ — (2k) up — ik eth. (7.85) 
and 
, R? 2p-2]-2,-2HI 
Iv GI = al +k Ry H “e ) (7.86) 


The quantum fluctuation ($2) may now be estimated as follows. Because the 
modes with wavelengths greater than the horizon (in the sense of the comoving 
Hubble length H ^!/R(t)) are expected to be responsible for the growth of 
(9?) with time [13], an approximation to ($?) is obtained by cutting off the k 
integration at k = Ro He". Then. from (7.74) and (7.75), 


l 
2 3 2 
= d'k 7.87 
W= uy J TAO (1.87) 
and, using (7.86), this gives linear growth in time: 

H? 

(92) x —;t + (constant). (7.88) 
4n 


It is important for a consistent model of inflation that quantum fluctuations do not 
result in ($) crossing the flat region of the potential faster than the time required 
for semi-classical slow roll across this region. In (7.57), the time to roll across the 
flat region (the period of slow roll) was 


t~H'N,. (7.89) 
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Thus, we require a flat region of width A4 with 


H? H?N, 
(^9) > aD ^a (7.90) 
so that we require ; 
1/ 
HN, 
Ad > 2m (7.91) 


We shall see in the example in section 7.8 that there is often a stronger constraint 
from the rquirement of obtaining density perturbations of the size found by 
COBE. 


7.7 Density perturbations 


The quantum fluctuations in the inflaton field discussed in the previous section 
result in density perturbations [14—16] in the post-inflationary universe, which 
may be responsible for galaxy formation. The density perturbations arise because 
the quantum fluctuations in ó give $, i.e. the expectation value of ĝ, slightly 
different values in different regions of space. This results in perturbations to the 
value of the vacuum energy density. 

Central to the discussion of the formation of density perturbations is the fact 
that a given comoving wavelength (i.e. a wavelength in units of the scale factor 
R(t) of the universe) can start inside the horizon before inflation begins, cross 
outside the horizon at some time during inflation, and then cross back inside the 
horizon after inflation has ended and a radiation-dominated universe has been 
established. (By ‘horizon’ we shall mean here not the particle horizon but the 
comoving Hubble length H ^! / R(r). This is a measure of the distance light travels 
during an appreciable amount of expansion of the universe. Inside of a comoving 
Hubble length, causal processes do not feel the expansion of the universe.)) This 
behaviour is a consequence of the fact that when R(t) is increasing as a power t? 
of t with p < 1, the comoving Hubble length increases with time, whereas when 
R(t) is increasing exponentially with time, the comoving Hubble length decreases 
with time, while the comoving wavelength is, by definition, constant. (See 
figure 7.2.) The (classical) inflaton field perturbation 8$ (t, x) may be expressed 
in terms of perturbations 8$ (t, k) of momentum k as 


8$(t, x) = J Pkel** ad (r, k) (7.92) 
where we have written the complete inflaton field $ (t, x) as 


$ (t, x) = polt) + S(t, x) (7.93) 


with @o(t) the homogeneous classical field. Quantum fluctuations 6$ (t, k) 
develop when the comoving scale |k]! / R(t) is inside the horizon and become 
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Figure 7.2. Behaviour of the comoving Hubble length during and after inflation. 


‘frozen in’ when this comoving scale crosses outside the horizon, so that it is no 
longer subject to causal processes. When this comoving scale crosses back inside 
the horizon, the fluctuations reappear as classical density perturbations. 

Calculation of à$(t, k) requires a generalization of the equation of motion 
(7.33) of the homogeneous classical field $o(t) to include the spatial dependence 
of $ (t, x). Including this dependence (7.29) leads (exercise 2) to 


@ —R?V?$ -3Hó + V'($) 20 (7.94) 


where a flat space has been assumed. (Note in passing that had we allowed 
$ to have spatial dependence in the discussion in section 7.4, this would have 
been damped out rapidly because of the exponential growth of R(t) during the 
inflationary period.) Comparing (7.33) for $o(t) with (7.94) for @(t, x), we see 
that 5$ (t, k) obeys 


(86) + 3H (69) + Rs ^e?" k S$ + V" (o)3 = 0. (7.95) 
For slow roll away from a maximum of the potential, we must have 
V"($o) « 0. (7.96) 


The perturbations start to grow when the fourth term in (7.95), which is the 
destabilizing influence, becomes larger than the third term. Thus, d@(r, k) starts 
to grow at a time t*(k) (where k = |k]) given by 


Ry2e 2H k? = —V" (gp). (7.97) 
For t 7» t* (k), the third term in (7.95) can be neglected and ô$ obeys 


56 + 3H8d = —V" ($959. (7.98) 
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We also know that œo obeys (7.33): 
do + 3H do = —V' (oo). (7.99) 
Consequently, ; 
555 (99 + 3H Žo) = V (Hodo) (7.100) 


so that à obeys the same equation as $o when H is nearly constant, as it will be 
during the period of slow roll. Thus, 6¢(t, k) must be proportional to ġo with a 
k-dependent constant of proportionality which we write as —8t (k): 


6$ (t, k) = —6t(k)óg(0). (7.101) 
Substituting into (7.93), 
b(t, x) = polt) — ôt (x)óo(t) (7.102) 
where 
ér(x) = f d'k ef * 5z (K). (7.103) 
To first order in ôr, 
P(t, x) = dot — óv(x)). (7.104) 


Thus, the scalar field fluctuations introduce a spatial dependence into the classical 
field $o(t) which is of the form of a spatially-dependent time lag. Also, for 
t X 1*, the V” (ġo) term may be neglected and 8¢ is just the quantum fluctuation 
of a free massless scalar field in de Sitter space, which is known to be 


; H 
8$, k) = Mv; Ry 2k H 2e 2H1)/2. (7.105) 


In this limit, the equation obeyed by 3ó(t, k) is identical to (7.77) and 5@(t, k) 
is the same as |V| up to a normalization factor, as can be seen from (7.86). 
The normalization is determined by the requirement that 8$ (t, k) is the rms 
fluctuation [14], so that 


. k X 
àó(t, k)? = (z:) TAR (7.106) 


Perturbations in a scalar field will produce density perturbations because the 
potential energy V ($) is modified by perturbations in $. Thus, 


óp = ôV = V'(go) 50. (7.107) 


A calculation of the evolution of the density perturbations using the formalism 
of Olson [14, 17] shows that when the comoving scale |k|~? / R(t) crosses back 
inside the horizon, 


5 
" = AHót(k). (7.108) 


Density perturbations 213 


We may estimate à (k) by assuming that both the t >> 1*(k) and t. « t"(k) 
expressions for 5@(t, k) are tolerable approximations for £ œ t*(k). Equating 
(7.101) and (7.105) at t = t* (k) gives 


-H * 
M _ -2,2 jg -2, 2 H1* 1/2 
ót(t*, k) Andro) + Ry k He ) 
—H 
= ———_[1 + HV" (ty. 7.109 
eS ($xr*)] (7.109) 
When the slow-roll conditions are satisfied, 
V"($o) « H? (7.110) 
and we have H 
8z(t*, k)  ————. 7.111 
( ) ande) ( ) 
Then 2 
ô — x 
p ZE (7.112) 


p nien) 
In practice, t*(k) is a few Hubble times after the time when the comoving 
wavelength k^! crossed outside the horizon. In evaluating (7.112), we 
shall always identify ¢*(k) with the horizon crossing time given by k = 
R(t* ()) H (t* (k)). Then (7.112) is consistent to within a factor of order one with 
other, more rigorous, treatments [15, 16]. 
The COBE observations require that 


êp 2 x 1075. (7.113) 
p 


This allows us to estimate the energy scale of the inflationary potential V (¢). 
Using (7.41) and (7.47), we have 


õp — Vg)? 
— ^m . (7.114) 
p P V'(9) 
Then, using (7.113), it follows that 
mpV'($) VP. i6 iar 
V (9)!/4 ~ (e 101 10!7) GeV. (7.115) 
? Vio) ( 
Also, from the slow-roll condition (7.46), 
m p V'(d) ) 
——— « 3.5. 7.116 
( V) 019 
Thus, we expect the energy scale of the inflationary potential is given by 
V($)"* ~ (1016 10!7) GeV (7.117) 


though it could be less if the inflationary potential is very flat. 
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7.8 A worked example 


The discussion of previous sections may be illustrated by the following example 
of a suitable effective potential for inflation due to Steinhardt and Turner [5]. 
Consider the potential 

V = Vo- B9) +g’ (7.118) 


where Vo, $ and A are constants with B, A > 0. [tis assumed that the inflaton field 
$ starts rolling from @ = 0, possibly because $ was zero in a high-temperature 
phase where some symmetry was restored. In that case, slow roll began at 
some lower temperature after the high-temperature phase ceased to be the stable 
vacuum. It is also assumed that V is zero at the absolute minimum to which $ 
rolls, corresponding to zero cosmological constant. At the absolute minimum, 


o= 3 = (7.119) 
and V (o) = 0 requires that 
Vo = DP (7.120) 
When $ = ġo = 0, (7.41) then implies that 
H? = B = rE mz, (7.121) 


The slow-roll condition (7.45) implies that slow roll occurs for @ in the range 


= ge (7.122) 


where it has been assumed that ¢ < o in the slow-roll region. The second slow- 
roll condition (7.46) is automatically satisfied whenever 


V"($) ~ UE (7.123) 


and |@| is at least one order of magnitude less than m p. This is true here because 


-2V'($) 
$ 
when ó «& o < mp. It is being assumed that ø is less than m p so that we do 


not have to consider the effects of quantum gravity. Following (7.119), this is 
obviously arranged for 


V"($) = (7.124) 


B Smp. (7.125) 
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To avoid de Sitter fluctuations driving @ across the flat region faster than 
it would roll semi-classically, we need a width A4 for the flat region with the 
property (7.91). Combining this with (7.122) leads to 


> IL /2 
B sau Ne mp. (7.126) 


The criterion for sufficient inflation (7.57) yields 
324172 
B > 128 (=) Mp (7.127) 


where |V"(ó;)| has been evaluated at $; ~ Ho instead of $; ~ 0 to allow for 
quantum fluctuations and N, has been taken to be about 64. The condition (7.126) 
is clearly satisfied when (7.127) is. 

For density fluctuations of the order required by COBE, (7.112) and (7.113) 
imply that 

=H") 
x33 óo(t*) 

With the aid of (7.34), this leads to 


^ 1075, (7.128) 


105 H? (1,) 


~ xB) (7.129) 


if t* ~ te, the time at which slow roll ends. With ġe given by (7.122) and 
H? ~ =P Vo (7.130) 


using (7.41), and assuming that B/Am p S 1 (which will turn out to be the case), 
we then find that 
B~8.5 x 10°A3/2mp. (7.131) 


In practice, this is not a particularly good approximation because @ is not constant 
and do(t*) < do(te). A more careful calculation [5] gives a value of B several 
orders of magnitude larger. Note that even for the smaller value of 6 given by 
(7.131), the constraint for sufficient inflation (7.127) is satisfied with two orders 
of magnitude in hand and the de Sitter fluctuation constraint (7.126) is satisfied 
with four orders of magnitude in hand. Combining (7.131) with the condition 
(7.125) for ø to be less than m p, we find that 


à < 2.5 x 1078, (7.132) 
Then (7.131) implies that 
B - 
<P £13 (7.133) 


so that B/Amp < 1, as assumed earlier. 
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7.9 Complex inflaton field 


In previous sections, we have been assuming that the inflaton field $ is a real 
scalar field. A simple extension is to take $ to be a complex scalar field [18] with 


the Lagrangian density 
L = 8,09^$* — V($, $^). 


For a homogeneous field, the Euler-Lagrange equations are then 


6+3Hb+—~ = 
age — 
with the Hubble constant given by 


H? = ens! V (6. 99. 


When the energy is dominated by the potential energy, 


where we have separated ¢ into its real and imaginary parts 
$ = 6| + i$». 

For the slow-roll approximation to be valid, 

ES 


—| <1 
3H 


m «1. 
3H, 

These may be cast as the sufficient conditions (exercise 4) 
Vu Vi + V2Vi2 
VVi 
V22 V2 + Vi Vi2 

VV; 
Vi v2 
y2 


2 


m?, « 487 


2 


(7.134) 


(7.135) 


(7.136) 


(7.137) 


(7.138) 


(7.139) 


(7.140) 


(7.141) 


(7.142) 


(7.143) 


(7.144) 


where V, = 0V/d¢dq(a = 1,2) etc. The last condition ensures that the kinetic 
term may be neglected compared with the potential term in the vacuum energy 


density. 
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If slow roll occurs from ($1, $2) = (19, $29) close to a saddle point or 
minimum of the effective potential, then we may write 


Vi = Vio + Vito (G1 — Pig) + Vias (do — 29) (7.145) 
V2 > Vo, + Vi29 ($1 — Pig) + Voz (do — $29)- (7.146) 


In terms of the displacements, 
Xa = $a — Pao (a = 1,2) (7.147) 


and, correct to linear order in xg, the slow-roll equations may be written as 


X -A-MX (7.148) 
with 
e) ex) om 
end 1 ( Vib Vi% 
M= a Vp, V3. ) (7.150) 


After diagonalizing M, we find that the displacements x4 are superpositions of 
eigensolutions with time dependence e~*’ where 


= oe ( = 1,2) (7.151) 


2012 = Vito + V22 + Y (Vito — Va29)? + 4V2,- (7.152) 


The number of e-folds of inflation may then be written as 


with 


Ne = 2Vomin(—y7', -u3 ') (7.153) 


if jz; and u2 are both negative. Otherwise, Ne is controlled by the negative jz;. It 
is now necessary to have the potential sufficiently flat in all directions that there 
is slow roll no matter what direction of roll occurs off the maximum (or saddle 
point). 


7.10 Chaotic inflation 


Up to this point, it has been assumed that the initial conditions for slow-roll 
inflation are thermal. By this we mean that the field $ was at the minimum of 
the effective potential for a high-temperature phase until this minimum ceased 
to be the absolute minimum. Thereafter, œ appeared in the flat region of 
the potential, either by quantum mechanical or thermal tunnelling out of the 
metastable minimum or after the metastable minimum had ceased to exist. If 
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the flat region is in the vicinity of a maximum or turning point of the effective 
potential, some fine tuning of the initial conditions may be required if $ is to start 
out in the flat region. 

An alternative possibility [19] is that the initial conditions are provided by a 
chaotic quantum state which existed for times t < tp, where 


tp = mp! (7.154) 


is the Planck time. A simple chaotic inflation model can be developed using the 
Lagrangian density for the inflaton field $, 


L = 49,¢0"¢ — Vid) (7.155) 
with , 
V($)- ze (7.156) 


and A < 1, so that the potential is flat. At the Planck time, the uncertainty 
principle implies that V ($) can only be measured with an accuracy of mi. Thus, 
instead of $ being fixed at the minimum of V ($) at $ = 0 in all regions of space, 
we should expect @ to take values in the range 


mp mp 


— SOS m (7.157) 


in various regions of space (domains). These are the initial conditions for the 
domains. The evolution of $ fort > tp will permit a classical description, 
provided 

Vib) Sm> = yd“ S mj (7.158) 


in all domains. Since V(@) is, in general, non-zero, the various domains will 
undergo varying amounts of exponential expansion (inflation). 

Consider one such domain with an initial homogeneous field $ (tp). (As 
observed after (7.94), spatial dependence of $ is in any case damped out rapidly 
by the exponential growth of R(t).) For the potential (7.156), the Hubble constant 
of (7.41) is 


2 1/2 
H= (Gx) pmp.. (7.159) 
Neglecting the ¢ term in (7.33) for slow roll 
x A2 
-— (=) mpo (7.160) 


so that 


à 1/2 
$ = b(tp) exp |- (=) m p(t — »| : (7.161) 
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The inflationary growth of the scale factor R(t) is now given by (7.49) as 


t 
n £O -f H dt. (7.162) 
RGP) Srp 


Then, substituting the solution (7.161) for $ into the expression (7.159) leads to 


RO mx o fo [oae 
In Rip) M (tp) ( a| (2) mp(t — tp)|]. (7.163) 


For t ^ tp, this may be approximated by 


1/2 
21A 
R(t) = Rapexp | (2) e - tp) | . (7.164) 
3m?, 


From (7.161), we see that the motion is slow roll over a period 


À -1/2 1 
T (=) mp (7.165) 
during which time we see from (7.164) that there are 
2 
t 
Ne = 2x (2) (7.166) 
mp 


e-folds of inflation. Thus, there are at least 64 e-folds of inflation provided 
O(tp) Z 3.2mp. (7.167) 
This value is in the rquired range (7.157) provided 
as 107. (7.168) 


A region of the universe with such a value of $ (tp) could, therefore, develop into 
a universe in which the horizon and flatness problems are solved, as required for 
the universe we occupy. 

The observed value of p/p puts a more stringent constraint on the size of A. 
We estimate àp/p from (7.112) with > given by (7.160), H given by (7.159) and 
$ (t*) given by (7.207) with p = 4. Then 


ô 24/64 
2 S EET UNO (7.169) 
m 
where, as in section 7.12, N.(@(t*)) is the number of e-folds of inflation occurring 
after cosmologically interesting scales leave the horizon. Thus, 


8p V. 3x4 
he (~) INO). (7.170) 
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For |5p/p| ~ 107? and N-((t*)) = 50, we have 
à x3 x107, (1.171) 


The spectral index discussed in section 7.12 has the value given by 
n(k) — 1 = —0.06 (7.172) 


7.41 Hybrid inflation 


In the siow-rol! inflation models discussed so far, the inflaton @ which is 
undergoing slow roll is also responsible for the vacuum energy density that drives 
the inflationary expansion of the universe. In ‘hybrid’ inflation [20], two (real) 
scalar fields $ and y are involved. The field @ undergoes slow roll but most of 
the vacuum energy is due to the presence of yy. When the value of $ drops below 
some critical value 4, the field y is destabilized and it rolls from a vacuum with 
positive energy density to the vacuum with zero energy density, so that inflation 
ends. 
The simple original model, due to Linde [20] has a potential of the form 


v= jue * Ay -M° + A gag? (7.173) 

so that 
V=Vo+ "e - yu + Ang? * ay (7.174) 

where 
Vo = Myt and = my = AyM?. (7.175) 


The field y; has an effective mass-squared 
Meg = A24? — mi. (7.176) 


For 9? > 42 = m3 /A2, the effective mass-squared is positive and the only 
minimum of the effective potential in the y direction is at y = 0. The parameters 
may be chosen [20] such that the curvature of the effective potential is much 
greater in the y-direction than in the $-direction, so that, initially, y rolls to 
y = 0 while $? remains larger than 22. After a period of slow roll, $2 eventually 
drops below $2. At that point, y starts to roll towards a true minimum of the 
effective potential which is at 


$-0 y = +M. (7.177) 
This marks the end of the inflationary period. In the slow-roll region, 
V() ~ Vo + 1m?9?. (7.178) 


The spectral index 221 


Thus, using (7.58), 
` 8x Vo 


2 


~ —— (7.179) 
mpm? 


if Vo is the dominant term. N, can be adjusted to be greater than 64. For example, 
for Vo = (106. GeV)*, we have Ne > 64 for 


m € 10'3 GeV. (7.180) 


The parameter 92 = m [i gives enough freedom to obtain the observed value 


for p/p. (In the previous example ġe ~ 3 x 10!7 GeV.) We refer the reader to 
the paper by Linde [20] and the book of Liddle and Lyth in the general references 
for this chapter, for more detail. 


7.12. The spectral index 


The dependence of the density perturbation àp/p on the scale k will eventually 
allow different inflationary models to be distinguished by observations. Let us 


define ; 
6 2 2,,* 
P(k) = (2) = r | LEW (1.181) 
p polt*(k)) 
where we have used (7.112). (P (k) is proportional to the ‘power spectrum’.) The 
spectral index n(k) is defined by 


din P(k) 
-l=——. 7.182 
n(k) -l dink ( ) 
If n(k) is a constant, this reduces to 
P(k) x k"! (7.183) 


so that n = | corresponds to a scale-independent spectrum. 
The spectral index may be evaluated using the slow-roll conditions. Slow- 
roll parameters €($) and n($) may be defined by 


V (6) 
ez 5M? ( voy (7.184) 
ane v"(@) 
= MÀ 7.185 
n(o) P Vb) ( ) 
where 5 
m=% o] (7.186) 
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The derivatives de/dink and dy/d Ink are required for the discussion of n(k). 

From (7.34), during slow roll, 

_ 3H 
V'($) 


Moreover, the right-hand side of (7.181) is to be evaluated, as discussed after 
(7.112), when 


dr = dé. (7.187) 


k= R(* ()) H (r* (k)). (7.188) 


During slow-roli inflation, the rate of change of H is small compared with the rate 
of change of R and so 
dink = Hdt. (7.189) 


Combining this with (7.187) gives 


d Lol. y (wi 
Tink = 3m May (7.190) 


Then, using (7.41) for H 2 


d — wd 
Ink |. MPV) d$ 


(7.191) 


a 


With the aid of this, we may now evaluate the required derivatives of the slow-roll 
parameters (7.184) and (7.185), with the result 


de 


—— _ Ie? 
Tnk” 2(en — 2«^) (7.192) 
and d 
n (| _ £2 
dink = 2en — E (1.193) 
where yray" 
¿= Mm OY @) (7.194) 


vo)? 
Returning to (7.181) and (7.182), we may first simplify P(k) using (7.34) 
and (7.41) to obtain 


1 V(@) 
Pk) = ——. 7.1 
= eric ri (7.195) 
Then differentiating with the aid of (7.191) gives [21] 
n(k) — 1 = —6€ + 29. (7.196) 


Being slow-roll parameters, e ($) and 7($) are small. Consequently, 


In(k) — 11 <1 (7.197) 
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and n(k) differs little from 1. The deviation of n(k) from 1 at values of k of 
interest (k^! between the present Hubble radius of 3000 Mpc and the smallest 
scale for large-scale structure observations of 1 Mpc) is determined by the slow- 
roll parameters e($) and n($) for the given model. The variation of n(k) with k 
is also easily calculated using (7.192) and (7.193) to be [22] 


d In n(k) 
dink 


The spectral index distinguishes models of inflation. Consider, for example, 
an inflationary potential of the form 


= 16en — 24€? — 2£?. (7.198) 


V($) = X9? (p > 0). (7.199) 
(The chaotic-inflation potential (7.156) is a special case with p — 4.) Then 
e$) - 3M? phó? and =n) = Mp p(p - 097? (7.200) 
so that 
n(k) - 1 = -Mip(pt24$7? (7.201) 


with $ to be evaluated at £ = t*(k) for scales k of interest. The key thing is the 
number of e-folds of inflation that occurs after cosmologically interesting scales 
leave the horizon. From (7.51), what we require is 


% VQ) 
Ne(b(t*)) = Mg? Í —— d (7.202) 
«e P gue) VO) 
where ġe corresponds to the end of slow roll. In the present model, 
V($) -1 
— = (7.203) 
ve ^? 
leading to 
$7 (t*) — 92 = 2Ne(H(t*)) PM. (7.204) 
Slow roll ends when e($) ~ 1 and, from (7.200), we see that this happens when 
$ = be ~ pMp. (7.205) 
Thus, 
G°") = p! M$ + 2Ne(G(t")) pM5. (7.206) 


For modest values of p and large values of N&($(t*)), 


$(*) x J2N-(P(t*)) pMp (7.207) 


so that from (7.201) A 
+p 
n(k) — 1 œ ——————. (7.208) 
( 2Ne(G(t*)) 
For example [23], if Ne(@(t*)) = 50, we have 
2+p 
k)-1z--——— 7.209 
n(k)-1 100 ( ) 


and n(k) at observable scales differs from 1 by a few percent. 
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7.13 Exercises 


1. Obtain an alternative derivation of the equation of motion (7.33) for @ from 
the energy-momentum conservation equation. 

2. Generalize the equation of motion for the inflation field $(t, x) to include 
spatial variation (7.94). 

3. Redo the calculations of section 7.8 for the potential 


V=V-ag?+Ag* — wiho,A» 0 


4. Derive the sufficient conditions (7.142) to (7.144) for the slow roll for a 
complex inflation field. 


7.14 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e Kolb E W and Turner M S 1990 The Early Universe (Reading, MA: 
Addison-Wesley) 

e Liddle A R and Lyth D H 2000 Cosmological Inflation and Large-Scale 
Structure (Cambridge: Cambridge University Press) 

e Olive K A 1990 Inflation Phys. Rep. 190 307 
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Chapter 8 


Inflation in supergravity 


8.1 Introduction 


In order for new inflation (slow-roll inflation) to produce sufficient e-folds of 
inflation, it is necessary for the effective potential for the inflaton to be very 
flat near the point where slow roll begins. In general, even if the tree-level 
potential has this property, this flatness may be lost once radiative corrections 
are included. For this reason, it is advantageous in constructing models of 
inflation to employ a supersymmetric theory where cancellations of radiative 
corrections are enforced by supersymmetry. If we want a supersymmetric theory 
that contains gravity, the natural approach, as discussed in section 2.8, is to 
construct a locally supersymmetric theory (supergravity). Once a superpotential 
and a Kahler potential have been chosen for the inflaton field, the Lagrangian 
terms and the effective potential for the inflaton follow. 

As we shall see, viable inflationary theories can be constructed with simple 
choices of superpotentials and Kahler potentials. The positive value of the 
potential required for inflation may arise either from the first term in (2.156), 
the F-term, or from the second term in (2.156), the D-term, in the case that the 
inflaton is coupled to fields charged under a U(1) gauge symmetry. Theories of 
chaotic inflation and hybrid inflation may also be constructed. 

There are two potential problems arising from supergravity models of 
inflation which will be addressed in section 8.5 and section 8.6. The first 
is that gravitons (whose density was rendered negligible by inflation) may be 
produced by reheating after inflation. It is necessary to arrange that the reheating 
temperature is such that the gravitons do not have a serious effect on the 
abundances of deuterium and of He relative to “He abundance. 

The second problem (the so-called ‘Polonyi’ problem) results from the 
presence in supergravity theories of scalar fields with only gravitational strength 
interactions, which release the energy stored in their expectation values at very 
late times. This can lead to negligibly low helium and deuterium abundances at 


226 DOI: 10.1201/9780367806637-8 


Models of supergravity inflation 227 


temperatures too low for the necessary abundances to be recreated. It can also 
lead to too low a baryon number density. 


8.2 Models of supergravity inflation 


The key thing we require to study inflation is the effective potential for the inflaton 
field, which we assume to be a gauge-singlet scalar field $. If we also assume 
minimal kinetic terms for @ arising from (2.151), then the effective potential is of 
the form 


2 
" aw 
V 2e? *[|— 4. g*w| —3)|w/? (8.1) 
ap 
as in (2.152), in units where M p of (2.145) is one. In general [1], we may consider 
a superpotential which is a power series in , 


oo 
Wid) =u? Y Ang” (8.2) 
n=0 


where, as usual, we are not distinguishing notationally between the chiral 
superfield and the scalar field in that supermultiplet. When the expectation value 
of $ is real, the explicit effective potential corresponding to the superpotential 
(8.2) is (exercise 1) 


2 
V = u*e? [42 — 349 + 4A1(A2 — À9)ó 
+ (AR — A? 412 — 21239 + 6133)? + 20139 + 6A2A3 + 4104)? 
Od HAZ 935 22339 + ZArA3 + 2AOAG 161234 + 10A1A5)6" 
Tee] (8.3) 


A particularly simple case [2] is to take W ($) quadratic in $: 
W($) = uo + Aid + 329?). (8.4) 


The form of W(@) is further restricted by the requirement of the existence of 
a supersymmetry-preserving minimum of V($) with V = 0 for the following 
reasons. We have seen in section 7.7 that the energy scale of the inflationary 
potential is expected to be of order 1016-10? GeV. After inflation has occurred, 
$ rolls to a minimum of this potential. This minimum should have V = 0 
because otherwise there would be a vacuum energy on the 10'©-10!7 GeV scale 
which could not be cancelled by later supersymmetry breaking on the electroweak 
scale. There would then be a large cosmological constant. This minimum 
should be a supersymmetry-preserving minimum because otherwise there would 
be supersymmetry breaking on a scale too large for the hierarchy problem to 
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be solved. Let this minimum be at @ = c. From (2.158), supersymmetry 
conservation requires 


aw 
— W =0. 8.5 
36 té (8.5) 
From (8.1), V = 0 requires, in addition, that W = 0. Thus, we require that 
E = W=0 at$ =o. (8.6) 
In general, we require 
oo lo e] 
Jano" =0 and — mo"! =0. (8.7) 
n=0 n=0 


In the special case (8.4), these lead to 


Z _ AL 232 
7A and — 4AgA2 = A2. (8.8) 
Then, 
Wep) = Kile — o. (8.9) 


Then, from (8.1) the effective potential corresponding to (8.9) is (exercise 2) 
V = ef ungi — 4095 + (60? + 1)9* — 403g? + 
+ (a^ — 60? --4)9? + 80 (0? — 1)6 +.07(4 — 307)]. — (8.10) 
Assuming that slow roll occurs from close to the origin, we need 
V'(0) — 0. (8.11) 


Then 
o(a? — 1) 20. (8.12) 


For the minimum not to be at the origin (since $ must roll from a maximum), we 
do not want o = 0. Thus, o. must be +1 and we take 


o=1 (8.13) 
without loss of generality. Then (8.9) becomes 
W() = pilo - 1)? (8.14) 


(still in units where Mp = 1) and (8.10) is 


VAH) =e” u*32(1 — 9? — 49? +764 — 495 + 465). (8.15) 
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Expanding e? in powers of ¢? for the purpose of studying the slow-roll region 
close to the origin, 


V (4) = PR — 49? + Bot — 89) + Fee 4... (8.16) 
It follows that 
Vo = V(0) = 4132. (8.17) 


Then, the Hubble parameter H relevant to slow roll is given by (7.41) and, in units 
with Mp = l, 
H? ~ Bẹ = tvo = 443 (8.18) 


recalling that m?, = 8 MÀ. Also, in units with Mp = 1, the slow-roll condition 
(7.45) is 


IV" ($) 
8.19 
Vip) (8.19) 
and the slow-roll condition (7.46) is 
V'($) ) 
lI—— 6. 8.20 
( V) « (8.20) 


With V ($) given by (8.16) and with $ close to zero, (8.19) requires that $ is in 
the range 
M 
OSS =e (8.21) 

(8.20) is automatically satisfied whenever V"($) ~ V'($)/$ and || is at least 
an order of magnitude less than m p. We certainly satisfy the latter requirement 
because $, = 4M p and, for small $, the former condition is also satisfied. 

To avoid de Sitter fluctuations driving ¢ across the flat region too rapidly 
(faster than it would roll semi-classically) we need a width Ad = ¢, for the flat 
region with the property (7.91). With the Hubble constant given by (8.18), then 


2 
urns < —_. (8.22) 


If we are able to arrange that N, ~ 64, then this requires that 
py? |A2| < 0.17 (8.23) 


or, equivalently, 
| Ho| < 0.098. (8.24) 


Turning next to the number of e-folds of inflation, (7.57) requires that 


3H? 
Ne m ——, 8.25 
|V"(6i)| (8.25) 
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To allow for quantum fluctations V’(¢;) should be evaluated at $; ~ Hy rather 
than dj ~ 0. (See, for example, [3].) A rough estimate may be made by keeping 
only the term linear in ¢ in V'($). Then, 


{Hol i 
~ — ~ ———. (8.26) 
* 8492 83u? 
N, ~ 64 is achieved for 
2 ~ -3 
u hal 2 1.13 x 1077. (8.27) 


This value of p2|ral satisfies with ease the bound (8.23) to avoid de Sitter 
fluctuations driving $ across the flat region too rapidly. 
Using (7.112) and (7.113), density fluctuations of the order required by the 


COBE observations imply that 
2,,* 
-B0 2x10 (8.28) 
x3 2óo(1*) 


(see [3). We certainly have t* < te, the time at which slow roll ends. From 
(7.34), 


$--——. (8.29) 
If we make a rough estimate of V'($) from the term quadratic in $, then 
V' (be) = -12u*3292. (8.30) 
If we also approximate H (te) by Ho of (8.18), 
$9 = 4435212402. (8.31) 
With these approximations, 


H(t) u^ 


———M—— X ———————. (8.32) 
mlte) —— 124302112192 
With t ~ t, and ġe ^ 1/8, the value of pe |A3| consistent with the density 
fluctuations (8.28) is 
u^ |A2| x 3.6 x 1075. (8.33) 


This value of 1?|A2| is sufficiently small that we get from (8.26) more than 64 
e-folds of inflation with ease. The condition (8.22) to avoid de Sitter fluctuations 
driving $ too rapidly across the flat region is also satisfied with ease. For smaller 
values of t*, and so of $(t*), pedal is even smaller. 

To decide whether sufficient inflation occurs in practice, it is also necessary 
to consider the initial conditions, because sufficient inflation depends on ¢ starting 
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rolling across the flat region between $ = 0 and $ = 4, from a value of $ 
sufficiently close to @ = 0. However, if initially the universe was in thermal 
equilibrium, then thermal effects may put ¢ well into the region @ > O and prevent 
enough inflation. This is referred to as the ‘thermal constraint’ [4]. In general, 
for minimal kinetic terms the zero-temperature effective potential V derived from 
(2.147) is 

V = e€(G;G! — 3) (8.34) 


and from (2.162), the finite-temperature correction vi to the effective potential 
is given by 


- N ; 
VI = constant + gr eG — 2) (8.35) 


in the limit of a large number N of chiral fields, which is a reasonable 
approximation in practice. If the finite-temperature effects are not to destroy the 
flatness of the potential, we must require that 


aV 
02 — t$ — 0. 8.36 
EP at $ (8.36) 
For a single gauge-singlet real scalar field ¢ with minimal kinetic terms, so that 
G; = G' = G'(9), these require that 


avy 
ad — 


G'(0) =0 (8.37) 


so that 
V(0) < 0. (8.38) 


It is then impossible for ¢ to roll to a (supersymmetry-preserving) minimum with 
V = 0. Thus, the thermal constraint is a very powerful constraint. It may 
sometimes be evaded if the inflaton has non-minimal kinetic terms. 

However, if a (weakly-coupled) inflaton field $ is out of thermal equilibrium 
for temperatures below the Planck scale, then the initial value of $ will, in general, 
have a broad distribution [5]. Consequently, it is unlikely that a randomly chosen 
horizon volume will possess a (smoothed-out) value of ¢ close enough to ġ = 0 
for much inflation to occur. However, any horizon volume which does have a 
value of $ close to $ = O will undergo inflation and, after inflation has occurred, 
most of space will be occupied by such regions. As a result, we are very likely to 
find ourselves in a region of space which derived from such a horizon volume at 
early times. For this reason, we shall not consider ourselves bound by the thermal 
constraint. 

We consider next reheating in the context of this simple supergravity model. 
For a gauge-singlet inflaton field with only gravitational strength couplings, we 
expect a decay rate 


To ~ —. (8.39) 
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Also, from the double derivative of the effective potential, a mass-squared m; of 
the inflaton of order 42 
2 BA 
mo~ 
P 
is to be expected, allowing for (6) ~ Mp owing to the effects of quantum gravity 
and restoring factors of Mp. Thus, 


(8.40) 


613 
BÀ 
Ds —À. (8.41) 
Mp 
The reheating temperature of (7.72) is 
343/2 
Ww 
Tg ~ (To Mp)!? ~ —2- (8.42) 
Mp 


ignoring the difference between m p and Mp. Using the estimate (8.33), this gives 


Tg ~ 10!! GeV. (8.43) 


8.3 D-term supergravity inflation 


To arrange for sufficient inflation in the model of the previous section, it was 
necessary to take u^ 31 ~ 1073, which is an unnatural fine-tuning of the 
superpotential. This is a generic feature of supergravity models where the positive 
value of the effective potential during inflation is due to a non-zero F-term in 
the sense that p + pw? Æ 0 in (2.156). (The terminology is because 
8W/84; + '* W is the generalization to the supergravity context of the auxiliary 
field usually denoted by F’ in the construction of the globally-supersymmetric 
Lagrangian.) When all relevant fields are gauge singlet and assuming minimal 
kinetic terms, the effective potential for the inflaton takes the form (8.1). There is 
a term quadratic in $, namely Vod*, where Vo = V (0). Keeping only this term 
and assuming a real inflaton, 


v"(¢) ~ Vo (8.44) 
and then, from (8.25), the number of e-folds of inflation is 
e~ sl (8.45) 
But when $ — ¢o = 0, 
R? ~ 1 Vo. (8.46) 
Thus, 


N, 7 1. (8.47) 
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To obtain sufficient e-folds of inflation generally requires some fine-tuning of the 
parameters of the superpotential, so that other quadratic terms can cancel the one 
displayed. In the model just discussed in section 8.2, the choice of superpotential 
is such that a quadratic term in V($) does not occur. Nevertheless, there is a 
similar problem because sufficient inflation required the parameter 42|A2| in the 
superpotential to be € 1073. 

This generic difficulty for F-term inflation can be avoided if the positive 
value of V required for inflation originates from the second term of (2.156) 
(referred to as the D-term because it generalizes the contribution of the auxiliary 
field denoted by D in the context of global supersymmetry). Because there is no 
factor of e?' in the D-term, the previous argument does not apply and D-term 
inflation [6] does not suffer from the generic problem discussed above. 

A simple example [6] is provided by the superpotential 


W =Aods¢- (8.48) 


where ¢ is the inflaton field and $4 are two other scalar fields with charges +1 
under a U (1) gauge symmetry. Let the Kahler potential K be chosen to have the 
minimal form, as in (2.151), and let the gauge kinetic function be minimal, as in 
(2.154). Then 

G — 6*6 + O4.64 + ó*4- + In|wp? (8.49) 


and (exercise 3), using (2.156), the effective potential is 


y = ef Hoe HOF yn, o. Pa + 161?) + po- CL + 141?) 
+ 16940 19-0] + 48o — 19.1? E). (8.50) 


A constant £ (the Fayet—Iliopoulos) term has been included, which can be present 
for a U(1) gauge symmetry. We assume that £ > 0. It may be checked 
(exercise 4) that (¢,,@—) = (0,0) is a minimum in the ($4, @_) space when 
the inflaton field @ satisfies 


ne sv = de. (8.51) 
For these minima, we see that 
V = dg? (8.52) 


Thus, the potential is then flat so far as the inflaton is concerned (and has a large 
positive curvature in the $4. and @_ directions). 

In a chaotic inflation scenario, we may assume the initial condition |$| > ġe. 
Then inflation will occur. The amount of inflation will be very large because the 
only lack of flatness in the effective potential, so far as the inflaton is concerned, 
is due to radiative corrections. Note that the positive value of V driving inflation 
is essentially due to the D-term. 
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8.4 Hybrid inflation in supergravity 


The hybrid inflation idea discussed in section 7.11 may be extended to the context 
of supergravity. A simple superpotential which allows hybrid inflation to be 
implemented [7] is 

W = óQua Va — i?) (8.53) 


where y, and v» are a pair of (chiral) superfields in non-trivial conjugate 
representations of some non-Abelian gauge group and @ is a superfield neutral 
under any gauge group. (We use the same notation for superfields and the scalar 
field associated with them.) Assuming minimal kinetic terms, as in (2.151) and 
(2.156), the corresponding effective potential apart from the last (D-) terms in 
(2.156) deriving from the non-trivial gauge properties of yı and y, is as follows 


y ze (Fpl? + [Fo P. + Fp? — 3112) (8.54) 
where 
OW |. 2 2 

Fy mu, OW = (1+ OM Aviv - p°) (8.55) 

oW 
Fy = ag, + TW = 2001 la y» — wid (8.56) 

aw 
Fin = gg; YW =A + Wry — nwo. (8.57) 


If y, and yr? roll rapidly to zero, then the effective potential for $ is 
V=e ua - e? +94) (8.58) 


since $ is a real scalar field being neutral under any gauge group. Expanding in 
powers of $2, 
V œ p(t 19^. (8.59) 


The cancellation of the quadratic term in $ evades the generic problem with F- 
term inflation discussed in the previous section. 

It may be seen by returning to the globally supersymmetric theory that it is 
indeed reasonable to take yı and y? fixed to zero. The globally supersymmetric 
effective potential is (following (2.120)) 


v= [pur ene jul (8.60) 
ME" ay ayz : 
= Ayip — KP +767 WP Wal?) (8.61) 


(apart from the D-terms for the gauge non-singlets y; and yr? and remembering 
that is neutral.) The absolute minimum of the effective potential is at 


$ — 0, yi-y- (8.62) 


Ls 
"S 
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However, if 


p> Esg (8.63) 


^ 
the fields yı and yz have positive effective squared masses and are confined to 
Vi = y2 = 0. The effective masses of y, and y2 are contained in the terms 


—-u^X Qna + VET) HAP nv. vaya). (8.64) 
Writing 


l 
—=(Aı t iBi) and 25 


y = L + iB2) (8.65) 
J2 J2 
the effective mass terms are 
3429? (A1 + B? + A3 + B2) — w?A(A1 A2 — Bi B2) (8.66) 


and the mass-squared eigenvalues are (A29? + 42)/2, both of which are positive 
when $ > d. 

Now the model for inflation reduces to one with a single real scalar inflaton 
with potential (8.59). The slow-roll conditions (7.45) and (7.46), in units where 
the reduced Planck mass M p is 1, are satisfied when 


IV” ($) V'($) V? 
3 d ——— 6. 8.67 
V@) < an (Te) < ( ) 


With the above potential, these give $? « 1/2 and $? « 1.15 respectively. Thus, 
the slow-roll region is 
<5. (8.68) 


To calculate the number of e-folds of inflation, it is necessary to consider 
the time dependence of $ during slow roll. In units where Mp = 1, (7.41) is 
= V/3 and in the slow-roll region V ($) ~ u^, so that 


2 
H~ E. (8.69) 
3 
Then (7.34) is 
; 2 243 
¢=-— (8.70) 
JA" $ 
which shows that $ is decreasing with t ford > 0. In Mp = I units, (7.51) gives 
ory 
Ne = -f VO) ag (8.71) 
e V$) 


= 4(67°- 77) (8.72) 
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so, for f « $i, we have 
Ne = 407°. (8.73) 


Thus, what sets the limit on the number of e-folds of inflation is how small $; 
can get. There is no limit set by slow roll because we have slow roll whenever 
$? < 1/2. However, there are radiative corrections to the effective potential that 
we should now take into account. At one-loop order, they are of the form 


wut ¢ 
V = ——-ln— 8.74 
1-loop = g42 ^ P (8.74) 
so that 2.4 
' _ AU? i 


This is of the same order as V'($) from the supergravity potential (8.59) (without 
radiative corrections) when 
À 
uad. 8.76 

$ in (8.76) 
For smaller values of $, the contribution to V'(ó) in the slow-roll equation due to 
radiative corrections is larger than V'($) from supergravity at tree level. Thus, we 
must truncate the contribution to N, from rolling in the uncorrected supergravity 


potential at 
/ À 


Then the contribution to N, from the period of slow roll before radiative 
corrections become important is 


Ne xni !. (8.78) 


This gives at least 64 e-folds of inflation (even without including any further slow 
rolling when radiative corrections have become important) for 


à € 0.05. (8.79) 


Recalling that Mi = m? /87, (7.114) and (8.59) lead to 


bp pw 


P X UPPS (8.80) 


in units where Mp = 1. If we estimate * as the time at which we can no longer 
neglect radiative corrections, then from (8.76) 


y= P 8.81 
Oo") = an (8.81) 
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and 


áp u? 


n ~ BR anys (8.82) 


The requirement that p/p ~ 2 x 107? fixes u in units of Mp once A has been 
chosen. For 4 chosen as in (8.79), we have 


u S 0.05Mp. (8.83) 


8.5 Thermal production of gravitinos by reheating 


The thermal production of gravitinos in the early universe can cause problems, as 
discussed in section 6.3. At first sight, a possible solution to these problems is for 
the gravitino density to be diluted by inflation. However, a gravitino density can 
be produced by reheating after inflation and it is necessary for this density to be 
low enough that the problem is not recreated. 

Gravitinos produced by reheating after inflation can have a serious effect on 
the abundances of deuterium (D) and ?He relative to the “He abundance. The 
problem is that D and ?He can be produced by photofission from *He by radiation 
from gravitino decay. These relative abundances are known to be very small and 
so we must avoid gravitino densities sufficiently large to violate these bounds. 

The gravitino density n3/2 produced during reheating by 2 — 2 scattering 
processes involving gauge bosons and gauginos has been estimated [8] to be given 
by 


n3 -13 TR 
— 2x1 —— _ 8.84 
a, 2x19 (s m) (8.84) 


where Tp is the reheating temperature. However, an estimate of the amount of D 
and ?He produced by photofission from ^He requires that 
232 643x107? GeV. (8.85) 


3 
m 
ny 


Thus, there is a bound on the reheating temperature: 


10 2 
< 1.5 x 107 (GeV) 


^ 


Tg . (8.86) 


my 


For example, for m3/2 = 100 GeV, Tg < 1.5 x 105 GeV. Subsequent calculations 
[9] have shown that the bound is less stringent than this formula suggests for 
larger values of m3/2, e.g. for m3/2 = 1 TeV, Tr < 2 x 10? GeV. There is also 
the danger of excessive gravitino production by decay of the inflaton. This is a 
very model-dependent matter but sufficient suppression can occur in particular 
models [10]. 
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8.6 The Polonyi problem 


This generic problem results from the presence in a theory of a light scalar field 
which has only gravitational strength interactions, with the consequence that it 
is decoupled during most of the history of the universe and eventually releases 
energy stored in its expectation value at a very late time [11]. This release 
of energy increases the entropy of the universe at a low temperature thereby 
producing a negligible baryon abundance and, worse still, negligible helium and 
deuterium abundances. The temperature may then be too low for the required 
abundances to be recreated. 

A simple example of this problem, from which it derives its name, occurs in 
the context of the Polonyi model for supersymmetry breaking in supergravity. We 
describe first this mechanism for supersymmetry breaking. The Polonyi model 
is an example of a model in which supersymmetry breaking occurs in a ‘hidden 
sector’, by which is meant a sector of the theory which couples to the ‘observable 
sector’ of quarks, leptons, gauge fields, Higgs scalars, and their supersymmetric 
partners only through gravitational interactions. The hidden sector of the Polonyi 
model employs a single gauge-singlet scalar field $ (not the inflaton) and its 
supersymmetric fermionic partner with superpotential 


W($) = KÈ + p). (8.87) 


(We are using the same notation for the chiral superfield and its scalar field 
component.) Minimal kinetic terms are chosen so that G of (2.144) has the form 


G = ó*ó + In W|?. (8.88) 
In (8.87), & is a real parameter with dimensions of mass and 
B-2-43 (8.89) 


in units where Mp = 1. The parameter B has been fixed to this value so that the 
effective potential of (2.147) 


V = Atesi tote + BP? — 316 + BP) (8.90) 


has its absolute minimum at 


@=V3-1 (8.91) 
with V = 0 and, so, the desirable feature of a vanishing cosmological constant in 
the physical vacuum. At this minimum, 


OW grw = Jag? £0. (8.92) 


Consequently, supersymmetry is broken, as discussed after (2.152). 
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If the field $ starts (because of quantum fluctuations) at some value 
of $ which differs from the minimum, then the energy density stored in 
the expectation value of $ is of order jz‘. The size of ji is related to 
the size of supersymmetry breaking effects. In supergravity theories with 
supersymmetry breaking in a hidden sector, the size of supersymmetry breaking 
effects transmitted gravitationally to the observable sector is on the scale of the 
gravitino mass m3/2 and, for supersymmetry to solve the hierarchy problem, m3/2 
should be about 100 GeV to 10 TeV. In the Polonyi model, the gravitino mass is 


52 
m32 = se, (8.93) 
P 


Thus, for m3/2 in the range 100 GeV to 10 TeV, we have 
10'° GeV < ñ < 10!! GeV. (8.94) 


(For a discussion of the gravitino mass in supergravity theories with hidden-sector 
supersymmetry breaking see, for example, [12].) 

For the discussion of the entropy increase of the universe when the Polonyi 
field vacuum energy decays, we shall need to know the expectation value of 
the Polonyi field. In the context of cosmological inflation, we should determine 
this expectation value by minimizing the total effective potential of the Polonyi 
field and the inflaton. (There may also be effects of quantum fluctuations.) The 
superpotential of the Polonyi field ¢ is as in (8.87) and, for definiteness, we may 
take the superpotential for the inflaton field ¢ to be 


W($) = Walo — o (8.95) 
as in (8.9). Thus, the total superpotential is 
Wilh, 6) = Wb) + W(9). (8.96) 
We also assume minimal kinetic terms so that 
G = $*$ + $*$ + In|Wrotl?. (8.97) 


Then the effective potential can be calculated from (2.147) in units where the 
reduced Planck mass Mp = 1. Taking $ to be real, working to quadratic order in 
$ (when $ « 1 in the same units), and remembering that $ ~ 0 during slow roll, 
the minimum of the effective potential may be estimated to be (exercise 5) 


2u? A20? 


$ ~ —————————ÁÀ. 8.98 

? usr — 4Bu? iio? 898) 
The values of the parameters of the Polonyi model are given by (8.89) and (8.94), 
and the parameters of the superstring model of inflation that we are employing by 


(8.13) and (8.33). Using these values, @ may be estimated to be 
$ = 3(1073-1074). (8.99) 
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Table 8.1. Event sequence when the inflaton vacuum energy decays before the Polonyi 
field oscillates. 


Time (t) pelt) g(t) Prad(t) 
tp «t «tp r$! mg? ~=pglts) 0 
t=tp~ rj! pe (tp) " Prad(tD) 
2,,-1/2 
Tg ^ mi Mp / 
Ip «tet m 0 xps f) ~ 
fj <t<ip~T3? ~T? -T* 
t=ip~ rj pg p) —. radiation 
= 3/2 ,,-1/2 
Tr "n Mp 


We shall assume in what follows that, in reduced Planck-scale units, 
$-f (8.100) 


where f is not many orders of magnitude less than unity. Considering the effects 
of quantum fluctuations will also lead to this conclusion. 

To proceed further, we need to specify the temperature at which the inflaton 
vacuum energy decays. The crucial thing is whether this is before or after the 
Polonyi field starts to oscillate [13, 14). 


8.6.1 Inflaton decays before Polonyi field oscillation 


Let us consider first the case when the inflaton has already decayed and reheated 
the universe before the Polonyi field starts to oscillate. Then, the sequence of 
events is summarized in table 8.1. Inflation ends at t = tp and, at that time in the 
present modei, the Polonyi field vacuum energy density is 


p (tp) = m9? (8.101) 


provided that @ is significantly less than 1 in reduced Planck-scale units. Also, 
the inflaton vacuum energy density is 


pe (tp) = ptr (8.102) 


from (8.17), because $ does not roll much during inflation. The value of the 
Polonyi mass-squared is determined by 


8? Vere 
2. t _ 32-2 
$7 ya -p'ü (8.103) 


m 
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where, from (8.93), 
ji~ my (8.104) 


in the present units and £ is given by (8.89). 
For B<t< fp, where the Polonyi field starts to oscillate at £ = t$ when 
the temperature is Tz, and its vacuum energy density decays at fp, 


NEC RN LC (8.105) 


is the ratio of the Polonyi field vacuum energy density to the radiation energy 
density praa. Since 


- Le 4 
Prad (tg) T; (8.106) 
with the approximation (8.101), we have 
pg) meg? 
Praa (t) 2737 


(8.107) 


Further progress requires a calculation of T5. 

Between t = tp, the time at which the inflaton vacuum energy density 
decays, when the temperature is Tp, and t = tj, the universe is radiation 
dominated. Thus, in reduced Planck-scale units, 


jV fÈ 

T] VR 
where we have used the fact that RT is constant whenever the number of particle 
species is constant, for conservation of entropy. This equation has the solution 


2? i 1 TM 
= 3 Prad(T) = 3 Pred (To) (z.) (8.108) 


_ Prad(TD) 
6T5T? ` 


(8.109) 


Att = t Prada (Tp) is one or two orders of magnitude larger than TA, using 
(2.22) with Ng + iNF = 2 for the supersymmetric standard model or 2 for 


the standard model respectively. Also, since t$ ~ mi! , we have 


O2 
T; mz > (8.110) 


(8.111) 
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If the Polonyi field were not to decay, then at T = 2.7K > 107?! Mp, we 
would have (D 
P3 - 
A atu. (8.112) 
Prad(t) $ 
If we want to avoid Dg ®)/ Praa (t) > 1, so that the ó energy density does not 
dominate the energy density of the universe and produce too large an expansion 


rate, we need - 4 
$ € (1075 — 10715m;/ . (8.113) 


With m é given by (8.103) and (8.93) and for 


m3/2 = 107!5-107'® (8.114) 
in reduced Planck-scale units, we then require 
$ € 1079-197 ^, (8.115) 


Comparing this with (8.100), this is an unnaturally small value by many orders 
of magnitude. Making (8.106) more precise only increases this by an order of 
magnitude. 

In practice, the Polonyi vacuum energy decays but the previous discussion 
suggests that there may be a problem with entropy production when it does decay. 
The decay occurs at temperature Tp with 


tipo T; (8.116) 
with 1 ; 
ryt ~ mi (8.117) 
and then, from (8.111), 
- 1/272 
pto) |"; $ 
L— v= (8.118) 
Prad (Ep) Tp 


We now require an estimate of the Polonyi field decay temperature Tp. 
Remembering that, between t = t$ and t = ip, the vacuum energy density 
for $ behaves like a gas of free non-relativistic particles behaving as in (7.64), 
P3 (t) is growing relative to Prag(t). Consequently, we may expect pg to dominate 
the energy density of the universe by t = fp. We therefore approximate the 
time dependence of the temperature by a universe dominated by the Polonyi field 
energy density Dj. (Recall that the inflaton vacuum energy density has already 
decayed and been converted to radiation.) Then we have to solve 


2 


Qe) e 
T] "VR = 32s 09) T; (8. ) 
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~ (2 2)" TD (8.120) 


with the result that 


$9 
Att = fp, and with fp given by (8.116), we get 


Tp ~ m2 Hn Ps). T; (8.121) 
Using (8.111), and with 5 (tg) = pg (tf) given by (8.101), we have 
Tp ~ mg, (8.122) 
Substituting this into (8.118) gives 


Ds (tp) an- 
$UD) 74035873, 
Prad(tp) $ 


Consequently, the ¢ vacuum energy density dominates the density of the universe 
at the moment of decay provided 


(8.123) 


$> my. (8.124) 


With mg given by (8.103), this condition is 
¢ > 1079 (8.125) 


in reduced Planck-scale units. With the estimate (8.100) of $, (8.125) is satisfied 
with ease. 

Provided that p; does dominate the energy density of the universe at the 
moment of decay, there is a further reheating of the universe (in addition to the 
reheating that occurred when the inflaton decayed) to a temperature 


Tr ~ mY* (8.126) 
where r$ is given by (8.117). For successful nucleosynthesis, we must have 


Tg > 1 MeV = 107?! Mp. (8.127) 
This requires 
ms 1074 (8.128) 


in reduced Planck-scale units. The value of m å used here (~ 1078) satisfies this 
bound with ease. There is then an entropy increase of 


Tp 


~ 3 
A= (2) ~ m; ~ 10892. (8.129) 
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Table 8.2. Event sequence when oscillations of the Polonyi field begin before the inflaton 
vacuum energy density has decayed. 


Time (t) pelt) g(t) Prad(t) 
~- ^ z! ^vi -2 ^n- 
tp<t< t3 ms t xps (tf) 0 
fj «ttp D ww wx 0 
t=tp ~r} > Prad(t) 
Te ~ mi^ uz! 

tp <t<ip~ rj 0 ^T) ~T4 
t=ip~ rj B Gp) — radiation 

P -1 

Tr ~ m M; n 


Whenever ¢ is not very much greater than 1 in reduced Planck-scale units, this 
is large and may dilute the baryon number density of the universe unacceptably. 
In that case, a reheat temperature Tr large enough to recreate the required baryon 
number density is needed. For low-temperature baryogenesis, 


Tg > 100 GeV ~ 107! 5M, (8.130) 
is required and, using (8.126), this imposes the bound 
ms 2, 10719-1971, (8.131) 


In the model being studied here, m a~ 1078 using (8.103), (8.104) and (8.89) and 
so it should be possible to regenerate the baryon number by a low-temperature 
mechanism. However, for other types of light fields with only gravitational 
strength interactions, their masses may be too small for the entropy generation 
problem to be solved in this way (and the entropy generation may also be larger). 


8.6.2 Inflaton decays after Polonyi field oscillation 


We consider next the alternative possibility [13, 14] that the inflaton vacuum 
density does not decay until after the Polonyi field has already started to oscillate. 
This might result in too low a reheating temperature to regenerate the baryon 
number of the universe except with a low-temperature baryogenesis mechanism. 
The sequence of events is summarized in table 8.2. Between the end of inflation 
at t = ty and the start of Polonyi field oscillations at t = tj; the inflaton vacuum 


energy density po (t) decreases as 17? and the Polonyi field vacuum energy density 
pg is essentially constant. Between t = 15 and t = rp, the time at which the 


inflaton vacuum energy density decays, both t = pọ and t = P3 decrease as t^?. 
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Att = t5, the inflaton vacuum energy has not yet decayed and this vacuum energy 
density is an effective matter density controlling the expansion of the universe, as 
discussed after (7.63). At this time, we expect 


^g 
Ht 3 (8.132) 
or one or two orders of magnitude greater, and we also have 
po = 3H? (8.133) 


in reduced Planck-scale unit, from (7.41). Thus, at t = tj 


pp ~ 3m3. (8.134) 
Also, at t = tj: 
;c img? 8.135 
P = m9 (8.135) 
where we can approximate $ by its value at t = ty because, to a good 


approximation, ó does not start to oscillate until t = tz. Consequently, because 
po and 0$ have the same time dependence between t — t$ and: = tp, 


pg (tp) | Da (t3) 15 
pelto) — Pelt) 6 ` 


(8.136) 


At t = tp, the inflaton vacuum energy density decays and so immediately 


afterwards (tp) 
Pptp l-2 
~ =". (8.137) 
Prad(tp) 6 
For f < tp but greater than fp, the time at which the Polonyi field energy decays, 


P3 decreases as T), whereas the radiation density decreases as T*. Thus, for 


tp «t «tp, - 
O50) — PS TR Q Tr 
Prad(t) — Prad(tg)T 6T 


where 7g is the temperature to which the universe reheats when the inflaton 
vacuum energy decays. 

First, consider what would happen if the Polonyi field $ were not to decay. 
For nucleosynthesis to be able to recreate the ^He and deuterium densities diluted 
by the increase in entropy due to inflaton decay, we must have Tg larger than 
1 MeV = 107?! Mp. Since 


(8.138) 


Tg ~ m3 My? (8.139) 
as a consequence of (7.72) with l'a ~ m3M>°, it follows that 


mg > 107^ Mp (8.140) 
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at T = 2.7K ~ 10! Mp, and then, from (8.138), that 
zt 42 
Prad(to) ~ 6 


To avoid P3 dominating the energy density of the universe and producing an 
excessive expansion rate, we need 


(8.141) 


¢ < 10-5 (8.142) 


in reduced Planck-scale units. Thus, there will be a problem if ó is not very small 
on the reduced Planck scale. 

This suggests that there might be a problem with entropy generation in the 
realistic case where $ decays before 2.7 K is reached. We must decide first 
whether or not the Polonyi vacuum energy density will dominate the energy 
density of the universe at the moment of decay. Att = fp, when the Polonyi 
field vacuum energy decays, from (8.138) 


pafo) — $iTg 
Pra (Ep) ~ 6Tp 


(8.143) 


Thus, we must next estimate the temperature Tp at which this decay occurs. The 
Polonyi field energy density pg grows relative to the radiation energy density as 
the temperature drops for tp < t < tp. We therefore approximate the time 
dependence of the temperature by taking the energy density to be dominated by 
På- Then, 


2.2 +2 3 
T R ] l T 
(5) = (3) = 3/3) = 3/6 (D) (7) (8.144) 
with solution » 
4T? 
T- $ 1725, (8.145) 
3pg (tb) 
Using (8.116), (8.139) and (8.101), we see that 
t$ M. mÉ 1 
- — (2%) 5.02) ~ oA ~ mo f2 
pg (tp) = ( 3 Dg t) m Palts) ~ Smot (8.146) 


where we have also used t~ ms! and tp ~ rj! ~ ms. Now, from (8.145) 
and (8.146), 


Tp ~ m," m3$ 75. (8.147) 
Returning to (8.118), 
0g (tb) 1-8/3,2 2 158/344/3, -2 
nodo a9 amg GP TR m (8.148) 
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The Polonyi vacuum energy density dominates the energy density of the universe 
when this is greater than one. The condition for this is that 


@ 2 ms m ~ Tg mt. (8.149) 


Assuming that baryon number regeneration has to occur at reheating after the 
inflaton vacuum energy has decayed to radiation, we require 


Tg 2 100 GeV (8.150) 
or, from (8.139), 
mg 2,107 !! Mp ~ 10’ GeV (8.151) 


for electroweak baryogenesis. Also, to regenerate the *He and deuterium densities 
after destruction of these nuclei by the decay products of the Polonyi field $, we 
require 
mg Z 10 TeV = 10 7 Mp (8.152) 
much as in (8.140) for the dilaton. 
If, for example, we take mg = 10 GeV and m; = 10 TeV, then for ó lin 
Planck-scale units, from (8.143), 


pD) — 1 
Prad(tp) 4 


x 10° (8.153) 


so that the $ vacuum energy density dominates the energy density of the universe. 
The increase in entropy of the universe in reheating after the @ vacuum energy 
density decays is 


~ \3 
Tr 22. 3/2. -3/2 
A~n =] -ó m, m- (8.154) 


where we have used (8.147) and Tg ~ m/ 2, For the same choices of mg and m à 
and $ ~ 1, (8.154) gives 
A~ 10° (8.155) 


which, though quite large, may not be inconsistent with a sufficiently large baryon 
number density surviving. 


If, however, we take m é ^ my2. as in the model being employed here, 
> 10° GeV, corresponding 


then with $ ^ 1, pg (tp) > pma(fp) when Tg > 
to mg 2 10-8 Mp, and the Polonyi vacuum energy density then dominates the 
energy density of the universe at the moment of decay. (For lower values of Tr 
the Polonyi vacuum energy density does not dominate.) In that case, A is only 
of order 1 for mg ~ 10-8Mp and no dangerous entropy generation need occur. 
Thus, the entropy generation problem is not present for moderate values of the 
parameters when the inflaton vacuum energy decays after the Polonyi field has 


started to oscillate. 
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8.7 Exercises 


1. Derive the effective potential of (8.3) from (8.1) using the superpotential 
(8.2). 

2. Derive the effective potential (8.10) for the superpotential (8.9) with minimal 
kinetic terms. 

3. Derive the effective potential (8.50) for a model of D-term inflation. 

4. Check that the D-term inflation model effective potential (8.50) has a 
minimum at $4. = $.. = 0 when the inequality (8.51) is satisfied. 

5. Estimate the minimum of the effective potential for the Polonyi field in the 
presence of the inflaton field. 


8.8 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e  Bailin D and Love A 1994 Supersymmetric Gauge Field Theory and String 
Theory (Bristol: IOP) 
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Chapter 9 


Superstring cosmology 


9.1 Introduction 


For energies small compared to the string scale (which is of the order of 
the Planck scale in weakly coupled heterotic string theories), heterotic string 
theory in ten dimensions reduces to four-dimensional supergravity theory once 
compactification of extra dimensions has taken place. Thus, the discussion 
in chapter 8 also applies to heterotic string theory with special choices of 
the superpotential and Kahler potential derived from specific string theories. 
However, there are other aspects of string theory cosmology that go beyond 
supergravity cosmology. 

Superstring theories contain massless (in the first instance) fields, referred 
to as the ‘dilaton’ and, more generally, ‘moduli’, whose effective potential 
is flat. These are a field theory manifestation of degeneracies of the string 
vacuum. In particular, if the compactification of the theory from ten dimensions 
to four dimensions occurs on a six-dimensional torus, certain of these moduli, 
the ‘T-moduli’, correspond to the freedom (before non-perturbative effects are 
considered) to vary continuously the radii of the torus along the associated 
axes. Similarly, the (expectation value of the) dilaton S is associated with 
the freedom to redefine the strength of the gravitational coupling. The dilaton 
and moduli fields are expected to obtain masses on the electroweak scale 
when supersymmetry breaking occurs and a non-trivial effective potential is 
generated. The supersymmetric partners of the dilaton and moduli (the ‘dilatino’ 
and ‘modulinos’) have a cosmology similar to the gravitino and can produce 
unwelcome densities in the universe. The dilaton and moduli fields themselves 
have a cosmology similar to the Polonyi field discussed in section 8.6 and can 
produce excessive entropy by late decay. These problems will be discussed 
in section 9.2 together with a possible solution through the thermal inflation 
mechanism. 

Another problem associated with the existence of the dilaton in particular is 
the need for it to settle into a minimum of the effective potential. If this occurs 
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only with difficulty, there can be adverse effects on inflation. This problem will 
be discussed in section 9.3. On a slightly more positive note, the dilaton and 
moduli fields with their flat potentials before supersymmetry breaking are possible 
candidates for inflaton fields and this will be discussed in section 9.4. 

The discussion up to this point in the chapter will assume that 
compactification to four dimensions has already taken place. However, there 
could be an era in the history of the universe during which all nine spatial 
dimensions are still of comparable size. The cosmology of this era and how 
it joins on to the era with just three large spatial dimensions will be discussed 
in section 9.5. [n particular, the question of why only three spatial dimensions 
become large will be addressed. 

All of this discussion is based on heterotic string theory. There are 
also promising candidate theories based on type IIA or type IIB string theory, 
containing extended solutions referred to as ‘D-branes’. The cosmology of D- 
branes will be discussed in section 9.6. 

Finally, in section 9.7 and section 9.8, we shall discuss two models for the 
universe which allow there to have been an evolution of the universe prior to the 
big bang. In the first of these models (pre-big-bang cosmology), the effect of 
the (weakly-coupled) heterotic-string dilaton on the cosmological field equations 
is exploited to obtain solutions with a growing positive Hubble parameter for 
t « 0 driving inflation before the big bang. In the second model (the ‘ekpyrotic’ 
universe), strongly coupled string theory is employed. Novel cosmology emerges 
from the existence of an 11th dimension in the dual M-theory which will be 
discussed in section 9.8. 


9.2 Dilaton and moduli cosmology 


Before discussing the cosmological implications of the existence of the dilaton 
and moduli fields and their supersymmetric partners, we give some arguments 
that allow the masses of these fields to be estimated [1]. The supergravity effective 
potential is given by (2.147). It is convenient here to rewrite this in terms of the 
F-term field 
Fi = ef’? (g^! y gi (9.1) 
and its adjoint 
F; = (F* = e8? (g- lg, (9.2) 


in the notation of (2.148). Then 
V = FFIGi - 3e. (9.3) 


The mass of the dilaton (or modulus) field, denoted by ¢ for the moment, is found 
by differentiating the relevant part of (9.3), namely 


V = FoF #GS —368 +--+. (9.4) 
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This means that we need an estimate of F at the absolute minimum of the 
effective potential to estimate these masses. 

First, because the dilaton and moduli fields have only gravitational strength 
interactions, for any superpositions of the fields GÌ ~ 1 in reduced Planck- 
scale units (Mp = 1). Because V = O at the absolute minimum (for zero 
cosmological constant), the square of the F-field responsible for supersymmetry 
breaking, assumed to be a superposition of the dilaton and moduli F-fields, is of 
order ef. Moreover, the gravitino mass is given by 


where Go is the value of G at the absolute minimum. Thus, 
F? ~ min (9.6) 


for the supersymmetry-breaking F -field superposition. If there is more than one 
superposition of the dilaton and moduli F-fields with a vacuum expectation value 
(VEV), we shall assume that (9.6) holds for each superposition or some are 
negligible. (With the usual definition of the scale of supersymmetry breaking 
Msusy, (9.6) is the statement 

m3. ~ m$usy (9.7) 


in reduced Planck-scale units.) 
Returning to the dilaton and moduli masses, 


av 
m = (—— (9.8) 
apadg* 
provided that the kinetic terms are minimal so that the $-field does not need 
rescaling. With V given by (9.4) and Fy of the order given by (9.6), m2 is a 


sum of terms of order eC? and terms of order Fy F® both of which are of order 
m2 /2" Thus, we might expect that 
mo ~ maia (9.9) 


where $ denotes a dilaton or modulus field. Similar but somewhat more 
complicated arguments can be made for the dilatino and modulinos. Detailed 
calculations confirm these expectations [1]. 

The cosmology of dilatinos and modulinos, which are light fermions with 
masses of order m3/2 with only gravitational strength interactions, resembles that 
of gravitinos. The dilatinos $ will have a decay rate 


T; miM (9.10) 


where m ; is the dilatino mass and a decay time 


-l1 
~r. (9.11) 
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To avoid “He and deuterium abundances being modified by the decay products 
of the dilatinos, we should insist on the temperature T; at the time of dilatino 
decay being larger than 1 MeV ~ 107?! M. If we assume a radiation-dominated 
universe during the period before the dilatons decay, then 


-1/2 13/2 ,,-1 
Tj, ~ m? Mp. (9.12) 

Thus, we require that 
mg > 107^ Mp ~ 10 TeV. (9.13) 


We expect the gravitino mass m3/2, which controls the sizes of soft 
supersymmetry breaking masses for the matter fields, to be not greater than about 
10 TeV to solve the hierarchy problem. This bound can just be satisfied, especially 
given that ms ~ m3,2 is only a rough estimate. The same discussion applies to 
the modulinos f. 

In any case, the cosmological problems that arise from the decays of light, 
very weakly-interacting fermionic fields can be avoided if there is a period of 
inflation, additional to the main period of inflation, to dilute this fermionic field 
density. However, the reheat temperature after this period of inflation should not 
be too high in order to avoid regeneration of the dilatino and modulino densities. 
Thus, as discussed in section 8.5 for gravitinos, we should have a reheating 
temperature 

Tg S 10? GeV (9.14) 


so that any necessary regeneration of the baryon number density should occur 
through low-temperature baryogenesis. 

The cosmology of the dilaton S$ and moduli 7; fields resembles that of the 
Polonyi scalar field discussed in section 8.6, and much of the calculation given 
there is unmodified. We shall focus on the dilaton field 5 but the discussion of the 
moduli fields 7; will be exactly similar. Consider first the case where the dilaton 
field has started to oscillate before the inflaton decays. With a dilaton field energy 
density at the end of inflation 


ps(tp) = 4m? s? (9.15) 


and assuming for the moment that the dilaton field does not decay, we find, as in 
(8.115), that the requirement to avoid the dilaton field energy density dominating 
the energy density of the universe today and producing too large an expansion rate 
is 

S « (1075 — 19716 54, (9.16) 


ms ^ m3/2 ~ 100 GeV-10 TeV ~ (10716-1071 Mp (9.17) 


we get 
$«107!!-107 2 (9.18) 
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in reduced Planck-scale units, which is much smaller than the value expected 
to be obtained when the effective potential for the dilaton is minimized in the 
presence of the inflaton field or when the VEV of the dilaton is shifted by quantum 
fluctuations during inflation. This value would instead be expected to be not many 
orders of magnitude less than unity, much as for the Polonyi field. (See (8.99).) 
Consequently, the dilaton field energy density would dominate the energy density 
of the universe today. 

In practice, S would be expected to decay but the previous discussion 
suggests the possibility of excessive entropy production when the decay occurs. 
The reheat temperature Ts g when the dilaton field energy density decays is 


3/2 


Tsp ~ m} (9.19) 


as a consequence of (7.72) with T's ~ m3. For successful nucleosynthesis to 
occur after reheating, we must have Tsg 2 1 MeV, which implies that 


ms > 107^ Mp = 10 TeV. (9.20) 


There is then an entropy increase 


T 3 
A= (=) (9.21) 
Tsp 
where Tsp is the temperature at which the dilaton decay occurs. In analogy with 
(8.122), 


Tsp ~ mi 68-205, (9.22) 
Thus, 
s2 
A~—. (9.23) 
ms 
For ms ~ 10 TeV, 
A ~ 10!45? (9.24) 


in reduced Planck-scale units. When S is not much smaller than 1 in these units, 
this is very large and may dilute the baryon number density of the universe 
unacceptably. Then the reheat temperature Tsg needs to be high enough for 
regeneration of the baryon number density to be possible. This imposes the bound 


ms 2, 1071-107" (9.25) 
as in (8.131). Thus, 
ms > 107-105 GeV (9.26) 


is required. This is not consistent with a dilaton mass of order m3/2. An exactly 
similar discussion applies for the moduli. 

As in the case of the Polonyi field, the problem is less severe in the case that 
the inflaton vacuum energy density does not decay until after the dilaton field has 
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already started to oscillate. If the dilaton field were not to decay then, following 
the discussion leading to (8.141), to avoid ps dominating the energy density of 
the universe and producing an excessive expansion rate, we need $ € 1075. This 
is not consistent with our expectation that S will not be very much less than unity 
in reduced Planck-scale units. Thus, ps will dominate the energy density of the 
universe. This suggests that there might be a problem with entropy generation in 
the realistic case where 5 decays before T = 2.7 K is reached. Then, following 
the logic leading to (8.154), the increase in the entropy of the universe in the 
reheating after the $ vacuum energy density decays is 


A~ mg asa, (9.27) 


As discussed in section 8.6, electroweak baryogenesis can occur at the reheating 
after the inflaton vacuum energy density decays when 


me 2, 107! Mp = 10’ GeV (9.28) 


and regeneration of the ^He and deuterium densities can occur following reheating 
after the dilaton vacuum energy density decays if 


ms > 10 TeV ~ 107^ Mp. (9.29) 


If, for example, we take mg = 107 GeV and ms = 10 TeV, then even for S ~ 1, 
we get 
A ^ 10 (9.30) 


which may not be so large as to be inconsistent with sufficient baryon number 
density surviving. A similar discussion applies in the case of moduli fields. 

As will be discussed next, even if too much entropy generation occurs when 
the dilaton or modulus vacuum energy density decays, it may be possible to 
regenerate the baryon number density (and “He and deuterium densities) in an 
extra stage of inflation referred to as ‘thermal’ inflation [2]. Such a period of 
inflation is produced by a scalar field æ with mass ma of order 100 GeV-1 TeV, an 
approximately flat potential, and a VEV (a) which is large on the 100 GeV-1 TeV 
scale. If this vacuum expectation value is too large, then such a field will produce 
a Polonyi problem of its own and so we require an expectation value which is large 
on the previous scale, but not too close to the Planck scale. So-called ‘thermal’ 
inflation takes place while the field o is trapped in the metastable minimum at the 
origin by thermal effects. For this trapping to be possible, the temperature should 
satisfy T > Ma. Otherwise the field would sit at the zero-temperature minimum 
away from the origin. For inflation to occur, the vacuum energy density should 
dominate over the radiation energy density and so we should have 


Vo > T^ (9.31) 


where Vo is the vacuum energy density of œ in the minimum at the origin. It is 
then possible for inflation to occur when 


ma £T S$ Vy". (9.32) 
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Because RT is constant during inflation, the number Ne of e-folds In(R¢/R;) is 


just In(7;/ Ty). Thus, 
yl/4 
Ne = In ( 9 ) . (9.33) 


Ma 


With mz ~ 100 GeV-1 TeV, the period of thermal inflation begins before the 
baryogenesis and nucleosynthesis that follow the earlier period of inflation. The 
thermal inflation may then sufficiently dilute the dilaton or modulus density to 
resolve problems with dilution of baryon number or “He or deuterium density 
caused by entropy generation when decay of the dilaton (or modulus) occurs. 
(Recall that the dilaton or modulus vacuum energy density is behaving like a gas 
of non-relativistic particles, as discussed before (7.64) in the case of the inflaton.) 


9.3 Stabilization of the dilaton 


As observed in section 9.1, the dilaton field has a flat effective potential before 
supersymmetry breaking. Let us assume that spontaneous symmetry breaking is 
due to non-perturbative ‘gaugino condensation’ in which a product of two gaugino 
fields develops a VEV. The effective potential of the dilaton including this effect 
may be calculated. It is then found that it is difficult for the dilaton to settle to a 
minimum of the effective potential. Before discussing this problem, it is necessary 
to review the form of the non-perturbative potential to be expected from gaugino 
condensation. 

The six unobserved spatial dimensions are usually compactified on an 
‘orbifold’ or a Calabi-Yau 3-fold which generally has three T-moduli determining 
the size of the compactified space in the three complex dimensions (as well as 
complex-structure moduli specifying its shape). For simplicity, assume that there 
is a single overall T-modulus field T (not to be confused with temperature). Thus, 
we take 


T=T,=h=T3. (9.34) 


If the various factors in the hidden-sector gauge group are labelled by the index a, 
the non-perturbative gaugino condensate superpotential Wpp is of the form (see, 
for example, [3] and references therein) 


Wap = D we (9.35) 


with 3 
Wa, = doe" S/an(T) 6, (9.36) 


In (9.36), n(T) is the Dedekind eta function, bg is the renormalization group 
coefficient for the ath factor of the hidden-sector gauge group including a 
contribution due to hidden-sector matter, d; is a numerical cofficient and so-called 
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‘Green-Schwarz terms’ have been omitted. The Kahler potential for the dilaton 
and moduli fields is 


K =In(S + $) 2 31In(T + T). (9.37) 

Then, the effective potential V derived from (2.132) is given by 

S R3 a 9 Wap |” 2 
(S - S)T -- TY V = |Wop — (S + ETE — 3|Wapl 
- 9 Wap | 
oT 

The non-perturbative superpotential should be chosen such that 

Re S~2 (9.39) 


at the global minimum of the potential, because of the connection between Re S 
and the value string of the gauge coupling constant at the string scale 


Re S= 2B cing’ (9.40) 


In particular, if the ath factor in the hidden-sector gauge group is SU(Na) and 
there are hidden-sector matter fields in Ma copies of Ng + Na fundamental 
representations, then 

ba = —Na + 4 Ma (9.41) 


and 


1 M, \Ma/(3Na—Ma) 
d, = (^. - Na) (327e)? Ma—Na)/BNa—Ma) (5) . (942) 


It is not possible to satisfy (9.39) with a single condensate but with two 
condensates minimization of the effective potential gives 
-~ N2M, — Ni M2 
Re Sx EET. — My - 3N, + Mi (9.43) 
which allows (9.39) to be satisfied for many choices of the integer parameters, 
together with yielding a realistic value of the gravitino mass m3/2 given by the 
value of e8/? at the minimum, as in section 9.5. There is also scope to tune the 
parameters to obtain V = 0 at this minimum and so zero cosmological constant. 
The dilaton stabilization problem [4] is a result of the peculiar shape of the 
potential V illustrated schematically in figure 9.1. If Re S starts larger than 2, 
there is only a very small region of Re § which allows it to roll to the desired 
minimum at Re $ = 2. If ReS starts smaller than 2, the very steep potential 
causes it to roll over the very low barrier, failing to be trapped at ReS = 2 
unless it starts very close to Re S = 2. Thus, trapping the dilaton in the desired 
minimum requires fine tuning of the initial conditions. (Strictly, we should correct 
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+ - 
2 ReS 


Figure 9.1. Representative two-condensate effective potential for ReS. The depths of the 
minima and height of the maximum are very small. 


the discussion to allow for Re S having non-minimal kinetic terms. This makes 
little difference.) 

However, the situation is drastically altered by several effects (5, 6] not 
included in the original treatment. First, the thermal energy density modifies 
the Hubble parameter and if the thermal energy density is large compared to the 
dilaton energy density, this can create sufficient ‘friction’ to allow the dilaton to 
roll into the Re $ = 2 minimum of V for a considerable range of initial values 
of Re S. Second, the dilaton couples to the energy density of matter and gauge 
fields and, third, Re S$ couples to the axion field, which is the imaginary part of S. 
We shall focus here on the first of these effects. 

For a homogeneous real scalar field @ with minimal kinetic terms and 
effective potential V ($), as in (7.33), 


$ó -3Hó t V'($) =0. (9.44) 


We now want to allow for contributions to the Hubble parameter from the thermal 
energy density of matter fields or radiation. We shall take the field ¢ to have a 
potential of the form 

V($) = Voe ?* (9.45) 


in reduced Planck-scale units and shall discuss later how this relates to the dilaton 
field. The pressure p. and energy density pẹ of the matter fields or radiation 
satisfy an equation of state 

Pe = (e - Dee (9.46) 


where € = 1 for a matter-dominated universe and € = 4/3 for a radiation- 
dominated universe during the rolling of $ towards the minimum of its potential. 


258 Superstring cosmology 


Then . 
H? = 4(V + $6? + pe) (9.47) 


is the generalization of (7.40) and (7.41), in reduced Planck-scale units. Using 
(9.47) and (9.44), 


: le. l, 
H = -3° + HPE (9.48) 


Also, from the energy-momentum conservation equation (1.45), 
Be = —3H (pe + pe) (9.49) 


so that . . 
= —5(G7 + pe + pe). (9.50) 


This ignores any interaction other than gravitational of $ with matter fields. These 
equations can be solved analytically for the asymptotic form of @ in the case of 
very steep potentials 

A? > 3e. (9.51) 


The field @ increases at first but the ‘friction’ corresponding to the Hubble 
constant then freezes ¢ at a near-constant value o for a time, where 


- J/6 l] 4- xo 
h = bo + egl (=) (9.52) 
and . 
do 
= 9.53 
xo JH (9.53) 


with do, $o and Ho denoting initial values. Finally, approaches the asymptotic 
form 


1 2A? Vo 3e 
= —In{ —~———_] + — In R 9.54 
v(t) n (sax) * n n R(t) (9.54) 


where R(t) is the scale factor (‘radius’) of the universe. The approach to 
the minimum is then slow and, after oscillations about the minimum with an 
exponentially damped amplitude, ¢ settles to its minimum. 

Let us now apply these considerations to Re S, taken for the moment to have 
minimal kinetic terms. In the region Re S < 2 but not too close to Re S = 2 
where the minimum has been produced by the balancing of two terms, the non- 
perturbative superpotential may be approximated by a single term Wap With S- 
dependence: 

Wa, ~ esas (9.55) 


where 
247? 


Aa = —— 
Na T 1M, 


(9.56) 
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is positive and has the smaller value. Then, the parameter A in (9.45) is given by 
A = 24a. (9.57) 


A more careful treatment [5] with the correct non-minimal kinetic terms for Re S 
makes very little difference to (9.54), with @ replaced by Re S. 

This discussion is valid provided that Re S is able to attain the asymptotic 
form of solution before passing through the minimum. Sufficient conditions [5] 
are to take the initial value of Re S, denoted by Re So, between the constant term 
in (9.54) and the value at the minimum Re Smin ~ 2, and also an initial velocity 
for Re S such that the constant value Re So, in the sense of (9.52), is also less than 
Re Smin. (In practice, the minimum we are trying to reach is at a smaller value of 
Re S than the adjacent maximum, and so we must start with Re S < Re Sp to have 
any chance of ending up in the minimum.) Thus, we require 


1 212, 
= In ( —77— | < Re So < Re Smin (9.58) 
A 9H3(2 — €)€ 


and an appropriate initial velocity for Re S. 


9.4 Dilaton or moduli as possible inflatons 


A priori, the dilaton or moduli fields (for an orbifold or Calabi-Yau 
compactification) are good candidates for inflaton fields (7] because their 
potential is completely flat to all orders in string perturbation theory. Non- 
perturbative effects, such as gaugino condensation, can provide a non-trivial 
effective potential. As we shall see shortly, if the dilaton or modulus field is 
to be used as the inflaton, it is necessary to assume that the superpotential is the 
sum of two components. (See, for example, [8).) One of these components has 
a large scale and gives an effective potential with unbroken supersymmetry and 
zero cosmological constant at the global minimum when the other component 
is neglected. This large-scale component is responsible for driving inflation 
when the dilaton or modulus expectation value is in a flat region away from the 
minimum. The other component has a much smaller scale and is responsible for 
supersymmetry breaking in the low-energy world. It is the former component 
of the superpotential that we are interested in here. Neglecting the low-energy 
component of the superpotential, it is convenient to write the effective potential 
for the dilaton S in the form 


V = g^ F(S, S) (9.59) 


where we are using reduced Planck-scale units, and F(S, S) is of order 1. To 
obtain density perturbations consistent with the COBE data, we require 


u ~ 1016-10! GeV (9.60) 
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as in (7.115). This contrasts with a scale of 
(m3/2Mp)'/? ~ 10!°-10!! GeV (9.61) 


for an effective potential due to the superpotential responsible for low-energy 
supersymmetry breaking with soft supersymmetry breaking masses on the scale 
of 102-10? GeV. This is the reason for assuming that the superpotential is the 
sum of two components. (It should be noted that the discussion of the previous 
section employed gaugino condensate superpotentials that had been designed to 
be responsible for low-energy supersymmetry breaking.) All of this discussion 
applies equally to the use of a modulus field as the inflaton. 

Though it is a very attractive idea, it has proved difficult to find a realization 
of it in practice. Multiple gaugino condensate potentials for the dilaton, such 
as discussed in the previous section, tend to be too steep in the Re S direction 
for inflation to occur. If, instead, the overall modulus field T is employed as 
the inflaton, with a superpotential consistent with the modular invariance of an 
orbifold compactification, up to 20 e-folds of inflation can be obtained. However, 
this does not appear to be possible when we demand that the effective potential has 
a minimum with unbroken supersymmetry and zero cosmological constant [9]. 


9.5 Ten-dimensional string cosmology 


Heterotic string theory begins as a ten-dimensional theory with the need for six 
dimensions to be compactified to provide us with the observed four-dimensional 
world (except in the case of direct constructions in four dimensions, such as the 
free-fermion construction). An attractive possibility is that the compactification 
of the six extra dimensions has a cosmological origin. In what follows we shall 
treat all spatial dimensions as being wrapped on a torus with all dimensions 
initially on the Planck scale. We shall study a mechanism, due to Brandenberger 
and Vafa [10], that naturally results in three of the spatial dimensions becoming 
very large, corresponding to a flat space, and the rest of the spatial dimensions 
remaining on the Planck scale. The dilaton plays a crucial part. 

As discussed in section 9.3, we expect the dilaton to acquire a mass of the 
order of the electroweak scale, or one or two orders of magnitude larger, when 
supersymmetry breaking occurs. For consistent cosmology, it is crucial that the 
dilaton does acquire a mass, because it is a scalar field with only gravitational 
strength interactions, and a massless field of this kind is inconsistent with solar 
system observations. However, there is no a priori objection to the dilaton having 
been massless in the early stages of the universe before supersymmetry breaking 
at a temperature of around 100 GeV. 

It will be assumed that the gravitational (metric) field and the dilaton field 
are slowly varying (adiabatic approximation) so that it is a good approximation to 
keep only the leading derivatives in the effective action. It will also be assumed 
that N spatial dimensions are large dimensions with time dependence, while the 
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remaining N. spatial dimensions are static compact dimensions. In the case of 
the heterotic string, 


N+N. = 9. (9.62) 


Eventually we want N = 3 but we shall leave N general for now. The effective 
action for the gravitational field and the dilaton is then (up to a multiplicative 
constant) 


So = J d"*!x Gje% (R + 4G^?84960p9) (9.63) 


where Gag is the metric tensor, G = det[G4g], R is the curvature scalar for 
the (N + i)-dimensional space with A = 0, 1,..., N and @ is the dilaton. In 
the case of N = 3, the connection with the value of the gauge coupling constant 
Sstring at the string scale iS gstring = e? and the connection with the dilaton field of 
section 9.3 is Re S = 2e~*#. (The antisymmetric tensor field in the supergravity 
multiplet is being ignored for simplicity.) All spatial dimensions will be taken 
toroidal with periodic length a;(t) fori = 1,2,..., N. Write 


aj(t) =e, (9.64) 
The metric is given by 
N H 
ds? = di? — Y ad (0) (x^. (9.65) 
i=] 


Keeping Goo general for the moment, and fixing it to 1 later so that we do not 
lose the field equation obtained by varying with respect to Goo, the curvature 
scalar (exercise 1) is 


N N 2 N N 
R- -oo| ya; +(x) +2304 - em yi. (9.66) 
i=l i=l i=l i=l 


It is convenient to define 


N 
&=29-) Ai (9.67) 
i=] 


which absorbs a factor for the volume of the space because 


e~% AIG] = /Gooe™®. (9.68) 


The action of (9.63) is then (exercise 2) 


N 
So = (fes) | dt Vane ton] F a _ e| (9.69) 


izl 
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In general, a thermal contribution Sr (from the gas of string modes in thermal 
equilibrium) should be added to the action. It has the form 


Sy = ( J a"s) f dt /GooF (ài, BV Goo) (9.70) 


where F is the free energy and 6 = T~! in units where the Boltzmann constant 
kg = 1. The complete action is 


Stor = So + Sr. (9.71) 


Varying this action with respect to ©, A; and G™ gives (exercise 3), after 
combining field equations, 


N 
(6)? - 3G =e%E (9.72) 
i=l 
Aj — dA; = le? P; (9.73) 
N 
$-'0) = fe%E. (9.74) 


i=l 


The thermodynamics that has been employed here is as follows. The free energy 
is 


F-E-TS (9.75) 


where E is the energy and S is the entropy with 


OF oF 
s=-(—) =f7(—}). . 
(57), - ^ (35), 079 
The pressure in the ith direction is 
oF 
P;=~-{—]. . 
(a ), o7» 


e=r-r(37) = F+ a) 9.78 
i aT), — "(5 v (3.78) 


E=F+2—— (9.79) 
00 
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where we have set Goo = 1 at the end. From (9.72)-(9.74), we may deduce 
(exercise 4) that 


N 
E+) AP; =0 (9.80) 


i=] 
Noor N 
le a . . . 
S=E-—- J à; — =E ) A; Pj. 9.81 
f i=l 9X t2 u een 


Combining these, we have . 
$20 (9.82) 


Thus, entropy is conserved. 

In the first instance, S is a function of f and the Aj. in principle, we can solve 
(9.82) to obtain f as a function of the A;. Then, E can be written as a function 
E(A) of the A; alone, where, for the moment, à denotes the A; collectively. When 


the entropy is constant, 
aE 
P=—-|—]. 9.83 
(5. ), 0.83) 


To solve the equations (9.72)-(9.74), we need some knowledge of E(A). For 
the moment we ignore the contribution of winding modes. The properties of 
E(A), now assumed to be a function of a single à when the A; have a common 
value, have been studied using the microcanonical ensemble [10]. It is T-duality 
symmetric, i.e. symmetric under the replacement of a; (t) by a; ! (t) so that 


E(a) = E(—2). (9.84) 


For A ~ 0, the so-called ‘Hagedorn region’, E(A) is almost constant. For 
sufficiently large A, only massless string modes contribute to the partition 
function, corresponding to a radiation-dominated universe. Then £(A) has the 
exponential behaviour 

E(a)~e™. (9.85) 


Between these two limiting cases there is incomplete knowledge of E(A). 
However, it is known that E decreases with |À] for à close to zero and this is 
believed to be correct for all A. 

This is enough information to see that a radiation-dominated era is 
approached as time increases if starts with a negative value. The argument 
is as follows. Because E is positive, (9.72) implies that È can never become 
zero, so that È can never change sign. Also, (9.74) implies that ® is positive. 
Consequently È increases and if it starts negative, it remains negative and 
approaches zero as £? — oo. Now consider equation (9.73). Using (9.83), and 
assuming that A; = A for all i, (9.73) is 


i — dA = —JePE"(A) (9.86) 
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(9.86) can be interpreted as the equation of motion for a particle with position A 
moving in a potential e? E(A) with a damping term, because È < 0. Since the 
potential decreases as |À] increases (ignoring the à dependence of ©), A slides 
towards increasing values of A. The radius of the toroidal space is a(t) = et) 
and, hence, as A increases a increases, when A > 0. (There is a duality between 
large and small values of a(t) and, hence, between positive and negative values 
of A, as noted in (9.84)). Thus, we need only discuss à > 0.) Since the spectrum 
of a string theory is known, the entropy can be calculated at a given temperature 
T in terms of the radius a(t). We know that entropy is constant. Therefore, a 
relationship between a(t) and T can be calculated numerically that ensures this. 
As |A| decreases, it is found that T increases towards a limit referred to as the 
‘Hagedorn temperature’. As |A] increases, T falls until the massive string modes 
go out of equilibrium and we enter a radiation-dominated era controlled by the 
massless string modes, as in the standard model of the universe. 

We now ask the question: “Why is the number N of large spatial dimensions 
equal to 3?’ A possible explanation turns on the presence of winding modes in 
string theories compactified on a torus. (Remember that we have taken all spatial 
dimensions to be toroidal.) Because of the periodic nature of a torus, the closed- 
string boundary conditions for the spatial bosonic degrees of freedom X*(r, o) 
can be satisfied when 


Xf (t,o +2) = X*(t,0) +20 L* (9.87) 
where centre-of-mass coordinates x* on the torus have the identification 
x* = xt + 2nL* (9.88) 


where L* are referred to as *winding numbers' and are proportional to the torus 
radius [11]. String modes with non-zero values of L* are referred to as winding 
modes. If p* is the centre-of-mass momentum of the string degree of freedom 
X*, the mass-squared of a string state includes (p* + 2L*)? and (p* — 2L*)?. As 
the radius of the torus increases, the squared mass of a winding mode increases. 

The idea is that string winding modes, unlike other matter densities, will 
oppose expansion of the dimensions of the universe. The reason for this is that, 
in the presence of winding modes, the behaviour of E(A) is very different from 
that discussed earlier. As just discussed, the mass squared of any winding mode 
increases as the square of the winding number, for large values of the torus radius, 
and so as the square of the torus radius. This effect results in E(A) increasing as 
e^. Roughly, the growth of E with A in (9.86) means that À « 0 (up to a damping 
term) so that à eventually becomes negative, A starts to decrease and the radius 
of the universe starts to decrease. In this way, the winding modes first stop the 
expansion of the universe and then reverse it. 

This argument is not quite correct because of the e? term in (9.86). As 
discussed earlier, if starts negative, it remains negative so that ® decreases 
with time. Thus, treating A as the position of a particle, the strengthening of the 
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potential with increasing A as the universe expands can be offset by the behaviour 
of e®. A more careful treatment shows that this does not affect the outcome. 

Once the universe starts to contract, the momentum modes, which have 
behaviour dual to the winding modes, play a crucial role. They make a 
contribution to E(A) that increases as A decreases and oppose the contraction. 
In this way, the universe is caused to oscillate between some minimum radius and 
some maximum radius within a few orders of magnitude of the Planck scale. 

The question then is how the universe can ever expand to a large scale. 
The answer is that string winding modes can annihilate totally or partially into 
momentum states (with complete annihilation of winding number occurring 
between winding modes with equal and opposite winding number). In this way 
they are able to reach thermal equilibrium with other string states. In thermal 
equilibrium the number of winding modes becomes small as the torus radius 
increases and winding-mode masses increase. However, it is difficult to reach 
equilibrium if the winding modes find it difficult to collide to annihilate. A 
collision corresponds to the two-dimensional world surfaces of the two states 
intersecting. Generically, this does not occur when the dimensionality N 4-1 of the 
extended spacetime in which the winding modes move is greater than 2 + 2 = 4. 
Thus, for N + 1 > 4, thermal equilibrium is not achieved and the winding modes 
stop the the universe expanding much beyond the Planck scale. However, for 
N +1 < 4, the winding modes annihilate readily and thermal equilibrium is 
reached resulting in a low density of winding modes. The universe is then able to 
expand to a large scale. 

This argument provides a partial explanation of the three-dimensional nature 
of our observed universe. If the universe starts to expand in some number N > 3 
of (spatial) dimensions, then the expansion is stopped by the winding modes. 
It then oscillates for a while before expanding again in some number N of 
dimensions that may differ from the first time. This may happen many times 
until finally the universe starts to expand in some number N of dimensions with 
N < 3. Then, the expansion continues. Of course, this only explains why N < 3 
and not why N = 3. Therafter, the discussion of the earlier part of this section, 
which neglected winding modes, applies and the universe evolves to a standard 
radiation-dominated universe. 


9.6 D-brane inflation 


The discussion so far in this chapter has been in the context of weakly coupled 
heterotic string theory. Alternative models of particle theory can be obtained from 
type II superstring theories because of the existence of extended so-called ‘Dp- 
brane’ solutions which occupy p+ 1 dimensions of spacetime. (See, for example, 
[12] and references therein.) As well as closed strings, the theory contains open 
strings which are constrained to have their endpoints on D p-branes. Chiral matter 
can be obtained from open strings whose endpoints are on D p-branes located at an 
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orbifold fixed point in the compact dimensions. We shall assume that this orbifold 
is a product of three two-dimensional tori with points on the tori identified under 
the action of a discrete Zy or Zm x Zyn group. 

Models with the gauge fields of the standard model (up to some U (1) factors) 
and the massless matter content of the standard model (up to some vector-like 
matter) have been obtained by employing D3-branes and D7-branes located at 
fixed points. The presence of D7-branes as well as D3-branes is necessary to 
satisfy certain ‘twisted tadpole’ cancellation conditions, which are required by 
a consistent string theory and, among other things, ensure non-Abelian gauge 
anomaly cancellation. The models also contain D3-antibranes and D7-antibranes 
which are needed to satisfy untwisted tadpole cancellation conditions. The chiral 
matter states are associated with open strings with their endpoints on D3-branes 
or with one endpoint on a D3-brane and the other on a D7-brane. In such theories, 
the string scale and the compactification scale do not necessarily coincide and low 
string scales are possible. 

An important aspect of such constructions is that the D p-branes and Dp- 
antibranes can be prevented from moving from the fixed points in some models 
by the requirement that the twisted-tadpole conditions are always satisfied. Then, 
the brane-antibrane separations are fixed except to the extent that they share in 
the contraction or expansion of a toroidal space when the radius of space varies. 
Thus, a modulus field in the associated low-energy supergravity theory whose 
expectation value is a brane-antibrane separation is not a candidate inflaton. 
However, possible candidates are the moduli scalars 7;, i = 1, 2, 3, the real parts 
of which are associated with the radii R; of the three tori in the form 


ti = Re T; = eê M? R? (9.89) 


where M; is the string scale and $ is the ten-dimensional dilaton. The four- 
dimensional dilaton $ is also a candidate. Models of inflation have been 
constructed [13] in which S or one of the 7; provides the inflaton while the other 
moduli (7; or S) are frozen by some unidentified mechanism. 

To discuss such models of inflation, we now require the form of the effective 
potential V as a function of the unfrozen modulus field. It is convenient to use 
T-duality with respect to all directions simultaneously: 


a’ 
Ri > — i=1,2,3 (9.90) 

Ri 
where 

a’ = M7? (9.91) 
to map D3-branes into D9-branes and D7-branes into D5-branes. We need the 
potential due to the tension in the branes. This is proportional to the volume of 
the branes and, for a theory of D9-branes and D5-branes, is of the form 


3 
V = NoVo + > Ns, Vs, (9.92) 


i=l 
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where No is the number of D9-branes, and Ns, is the number of D5;-branes, 
i.e. the number of D5-branes which wrap the ith torus (as well as living in four- 
dimensional spacetime). The potential Vg due to the D9-branes is of the form 


Vo = t9 R? RÈ R? (9.93) 
with the D9-brane tension 
To = agM!%e-¢ (9.94) 


where os is a dimensionless constant. Also, the potential due to the D5;-branes is 
of the form 
Vs, = 15R? (9.95) 


with the D5-brane tension 
ts = a5MSe? (9.96) 


and as a dimensionless constant. These potentials can be rewritten in terms of the 
real parts of the four-dimensional dilaton S and the 7; moduli fields: 


s = ReS = Me * R2R2R2 (9.97) 
ti = ReT; =M2e~*R? fori = 1, 2,3. (9.98) 
Then, 
3 
V =M? [Notos t Y Ns;ks, ‘| (9.99) 


i=l 
where kg and ks, are dimensionless constants of order !. There is also a 
contribution to the potential from the exchange of massless bulk states, such as 
the graviton, which we are neglecting here. It can be shown that this is small 
compared to the retained terms when the moduli are large. 

To apply this potential to the study of inflation, it is convenient to recast it 
in terms of fields with canonical kinetic terms. After Weyl rescaling to remove 
the factor of e^? in front of the curvature scalar (displayed in (9.63)), the kinetic 
terms for moduli are 


3 
1 
Lkinetic = -M2 leie” (a, Insdy Ins + Y In tj 8, In 2 (9.100) 


4 ‘ 
i=l 
the Weyl rescaling being 
Suv > Àguv (9.101) 
with 2.26 
M5e 
A= ILLI: (9.102) 
M5 Ri R3R3 


These kinetic terms arise from the curvature scalar 


R = G® Rap (9.103) 
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where a, b = 0, 1,...,9 and Gay is of the form 
Gab = diaglgyv(x), —R2(x)8mn, —R5(x)8rs, — R$ (x)9uv] (9.104) 


where x denotes the four-dimensional spacetime coordinates. In the first instance, 
the kinetic terms are then given in terms of Ri, R2 and Rs. After the Weyl 
rescaling (9.101), (9.102), these are recast in the form (9.100) in terms of s and 
ti. The potential is then rescaled by a factor A? to give 

koNg , ks Ns, + ks, Ns, + 2) 


vem, (2 


(9.105) 
fitot3 Stot3 52103 stati 


If, for example, we now freeze the moduli £1, t? and t3 and treat s as the 
inflaton, then the inflaton field X with canonical kinetic terms is given by 


$ = exp (=) . (9.106) 
P 


The potential for the inflaton is of the form 
V = o + kie 2XIMr (9.107) 
where xo and x; are constants 


ko = ko No M$ and a= ks, Ns, + ks, Ns, + ks, Ns, 


(9.108) 
tit2t3 tota 1h toti 


We can now study slow-roll inflation with this potential in the usual way. The 
slow-roll parameters € and n are defined in (7.184) and (7.185). In the present 
case, 2 

a e 2 2XIMp and nc 20 e^ V2XIMp (9.109) 
Ko KO 


ex 


when X >> Mp. Whenever X >> Mp, the parameters € and n are small and 
slow roll ocurs. Thus, slow roll is generic for large values of the modulus which 
is playing the role of the inflaton. This approximation also allows us to be in the 
low-energy field theory limit which requires weak coupling e? < 1. 

The next question to be addressed is when inflation ends. For x; < 0, the 
potential is such that X (which is certainly positive in this approximation) will 
decrease with time and, eventually, the slow-roll conditions will no longer be 
satisfied if |x;/xo| 2 1. Conversely, for xı > 0, X increases and there is no end 
to slow roll. However, as X grows, s grows, and either @ diminishes or one of 
the R; diminishes. If the latter, then at least one of the R; can become smaller 
than the string scale and low-energy field theory and the validity of the low- 
energy potential V break down. In this case, there is a striking mechanism which 
could end inflation. There can be a critical value of a radius at which a tachyon 
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appears in the spectrum. Then, inflation ends in a version of hybrid inflation. In 
some orbifold models a brane—antibrane pair then annihilates to produce a single 
brane with one dimension more than the original brane, wrapped around this extra 
dimension, which is stable at a smaller radius. The reheating is then controlled by 
the difference in tension between the brane—antibrane pair and the single brane. 

As to the number of e-folds of inflation, it does not appear possible [13] in the 
models described earlier to obtain a sufficient number of e-folds, except for large 
values of the Ns,. In that case, the approximation of retaining only the tension 
term in the potential breaks down and even including exchange of single bulk 
States (e.g. the graviton) in the brane-antibrane interaction may not be sufficient. 
However, this problem may be overcome [13] when the compactified dimensions 
form an orientifold rather than an orbifold. Then, there still remains the difficulty 
that we have had to freeze all but one of the 7; moduli or dilaton fields arbitrarily. 

An alternative type of brane model (see, for example, [14] and references 
therein) is based on intersecting D-branes with chiral matter living on (some of) 
the intersections, in the sense that it is associated with open strings that begin and 
end at a particular intersection of two D-branes. This type of model also provides 
a satisfactory model of inflation [15] up to a point but with the same difficulty of 
having to freeze all but one of the moduli discussed earlier. 


9.7 Pre-big-bang cosmology 


The presence of the dilaton in the heterotic string theory action allows for a 
possible alternative origin for inflation in a period of evolution of the universe 
before the big bang [16]. The basic idea is that it may be possible to join together 
two solutions of the cosmological field equations, one for £ < O0 and one for 
t > 0, with the following properties. The t < 0 solution is chosen to have the 
dilaton $ growing to produce a growing Hubble parameter H , so that the universe 
expands rapidly (dilaton-driven inflation). This is an even more rapid expansion 
than the more familiar inflation driven by an approximately constant H. The 
t > 0 solution, conversely, has || decreasing and ¢ rolling into a minimum of its 
potential. Thereafter, there is FRW cosmology (with a constant dilaton), possibly 
higher-dimensional, with compactification to three spatial dimensions to follow. 

As t = 0 is approached from t < 0, we shall see that it is possible for the 
value of ¢ to be positive and diverge logarithmically, which pushes the universe 
into a strongly-coupled regime because e?? controls the strength of the gauge 
and gravitational interactions in heterotic string theory. At some point, weakly- 
coupled string theory breaks down and non-perturbative effects may allow the 
transition between the t < 0 and t > 0 weakly-coupled solutions. We shall now 
fill in a little of the detail of this idea. 

The effective action for the gravitational field and dilaton is as in (9.63). 
We shall leave the number of spatial dimensions N arbitrary, allowing for the 
possibility that the cosmology starts higher-dimensional with compactification to 
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three dimensions to follow, e.g. cosmology could start with the full nine spatial 
dimensions of the heterotic string. With the ansatz of (9.65) and (9.64), the action 
is given by (9.69) leading to the field equations (9.72) to (9.74). Making the 
simple assumption that the universe starts empty, the thermal contributions may 
be dropped and the field equations simplify to 


N 

(6)? = Y'a; (9.110) 
i=l 

ài = 01; (9.111) 
N 

b= Yo (9.112) 


izl 
where A; is defined in (9.64) and ® in (9.67). 
For t Æ 0, there are solutions of (9.110)-(9.112) with (exercise 5) 


P(t) = do — In jt] and A(t) = Ao + a In |t] (9.113) 


in the isotropic case A; = A for all i. Equivalently, there are solutions 
2(t) = 269 +(£VN —1)Injt| and alt) = aott (9.114) 
in the isotropic case a; — a for all i. The corresponding Hubble parameter is 


1 
/Nt 


which diverges as t — 0. The two solutions for A arise because of the T-duality 
symmetry of the equations (9.110)-(9.112) under the transformations 


Hot ck (9.115) 


Àj > —Ài $=- (9.116) 
or, equivalently, 
N 
l 
a> g $—$6-) Inai (9.117) 


i=1 
which exchanges small and large scales accompanied by an obligatory action on 
the dilaton. The equations and their solutions also possess the usual t — —t 
symmetry of FRW cosmology. Because of the singularity at t = 0, in principle, 
we may pair either of the solutions fort < 0 with either of the solutions for t > 0. 
The hope is that non-perturbative effects will allow a smooth matching at t = 0. 
However, after the big bang, we require that the universe is expanding so, for 
positive t, we must choose the solution with a(t) «x TRA VN Now consider the 
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Figure 9.2. Hubble parameter for a possible solution linking the pre-big-bang era to the 
post-big-bang era. 


behaviour of the solution obtained by pairing this positive-time solution with the 
negative-time solution in which the universe is also expanding. That is 


Los — |WN- Dint t>0 
20(1) ~ 240 | —(V/N +1)In(-t) 1 <0 
(UN t>0 


409 oE 120 


(9.118) 


and 


H(t) — (9.119) 


l 
VNIrI 
As advertised earlier, (for N > 1) this gives $ (t) growing for all t 4 0, whereas 
the Hubble parameter H, which is positive on both branches of the solution, is 
growing for t < 0 but decreasing for t  O (see figure 9.2): $ (t) is discontinuous 
and diverges logarithmically as £ — 0. This ‘pre-big-bang’ cosmology provides 
an alternative to inflation due to a scalar field roiling in a potential. There is no 
potential for the dilaton ¢ but nevertheless the universe inflates when t < 0. 

The biggest difficulty, perhaps, is that it is not known whether the pre-big- 
bang and post-big-bang eras can be joined together smoothly (a form of the 
graceful exit problem) because the region close to £ = 0 requires non-perturbative 
string theory. It has also been suggested that fine tuning [17] is involved in a 
successful pre-big-bang scenario. One aspect of this problem is that as a result of 
(9.118), when a(t) increases by many orders of magnitude while t < 0 to solve 
the problems normally solved by inflation, e?(' also increases by many orders of 
magnitude. However, as observed after (9.63), e? is swing. This we expect to 
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be of order unity in the post-big-bang universe and so the initial value of $ must 
be tuned to be very small. Moreover, the density perturbations due to the dilaton 
(playing the role of the inflaton here) are not consistent with the COBE data. 
However, it has been argued that this last problem can be overcome if the density 
perturbations required by COBE are due to an axion field in a non-perturbative 
potential. (See the third reference in [16].) 


9.8 M-theory cosmology—the ekpyrotic universe 


Another example of a cosmology in which there is a period of evolution of the 
universe prior to the big bang can be obtained by considering the collision of the 
so-called ‘boundary branes’ which occur in M-theory. We first review briefly the 
M-theory description of strongly-coupled heterotic string theory (18]. 

Strongly-coupled heterotic string theory in ten dimensions is known to be 
dual to a theory in 11 dimensions. If the strongly-coupled heterotic string theory 
has six dimensions compacified on a Calabi-Yau manifold or orbifold X, then the 
dual theory has seven dimensions compactified on X x 5!/Z». Here S! is in the 
X’ direction with 


-rp < X < np (9.120) 
and the identification 

X! ~ x0 125. (9.121) 
The action of the Z2 group is 

Z2: X! > —x!?, (9.122) 


Consequently, there are two fixed points under the action of Z2, namely 
X?-0 and =X =a. (9.123) 


Because of the Z2 symmetry (9.122), the quotiented circle is equivalent to a line 
segment of length zp. 

The low-energy limit of this 11-dimensional theory is 11-dimensional 
supergravity. The Eg gauge fields of the observable sector (and the observable 
chiral matter) live on one end of the line segment (on one four-dimensional 
boundary brane) while the Eg gauge fields of the hidden sector live on the other 
end of the line segment (on the other boundary brane). The gravitational fields 
propagate in the 11-dimensional ‘bulk’. The dilaton expectation value e? in the 
ten-dimensional theory is reinterpreted, up to a numerical factor, as the length of 
the line segment in the 1 1-dimensional theory in M; ! units. 

The idea behind M-theory cosmology [19] (the so-called ‘ekpyrotic 
universe—the universe being consumed by fire and reconstituted out of fire, 
as in Stoic philosophy) is that two four-dimensional boundary branes may 
move towards each other and collide before separating again. The size of the 
fifth dimension (which is the separation of the two four-dimensional boundary 
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branes) goes through zero when the branes collide. Since the separation of the 
boundary branes is related to the dilaton in the dual description, the dilaton 
dynamics discussed in the previous section may be employed taking (the number 
of uncompactified spatial dimensions to be) N = 4 (and ignoring the six 
compactified dimensions.) As before, we require the universe to be expanding 
after the big bang but now we choose the alternative solution in which the universe 
is contracting before the big bang. Thus, the solution corresponds to the choice 
A= +(1//N) In [t| on both branches. For N = 4, this gives 


26(0 —24o — Inl]. — a(D iti? so that H = = (9.124) 


This choice is qualitatively different from the one made in pre-big-bang 
cosmology. There, before the big bang, $ (t) is positive and diverges as t — 0, so 
that the gauge and gravitational interactions, whose strength is controlled by e”4, 
become strong and the vicinity of t = 0 is in a strongly-coupled regime. Here, 
$ (t) is negative near ¢ = 0 and diverges as t — 0. Instead of strong coupling, 
the vicinity of t = O is in a weakly-coupled regime. When the branes collide, 
radiation modes are excited by the kinetic energy of the collision and a hot big 
bang is triggered. 

The ekpyrotic universe allows a new solution to the horizon problem. The 
collision of the two boundary branes is a non-local event over a region much larger 
than the Hubble radius. It is this collision that generates the temperature of the hot 
big-bang universe. Thus, a large degree of homogeneity in the cosmic microwave 
background radiation is to be expected. However, a number of difficulties in 
trying to get the ekpyrotic universe to explain things normally explained by 
inflation has been pointed out. (See [20] and references therein.) 


9.9 Exercises 


Derive the curvature scalar of (9.66). 

Derive the action (9.69). 

Obtain the field equations (9.72)-(9.44) from the action (9.69). 

Derive the finite-temperature equations (9.80) and (9.81). 

Derive the solutions of the cosmological field equations for pre-big-bang 
cosmology (9.113). 
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chapter are: 
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Chapter 10 


Black holes in string theory 


10.1 Introduction 


Besides ‘predicting’ general relativity as the effective low-energy (classical) 
theory of gravitation, string theory also provides a (perturbative) quantum theory 
of gravity. Since it is the only such theory known, one might hope that string 
theory would offer insights into the quantum aspects of gravitation that are not 
available elsewhere. This is indeed the case. 

In 1971, Hawking [1] showed that area of the event horizon of a black 
hole must increase with time. This prompted the observation that the area 
of the event horizon is analogous to the entropy of a thermodynamic system. 
It was subsequently shown that quantum mechanics requires that black holes 
besides absorbing radiation must also emit it and that black holes are indeed 
thermodynamic systems. Specifically, it was shown by Bekenstein [2] and 
Hawking [3] that the entropy is proportional to the area of the event horizon. 
Our experience with statistical mechanics as the microscopic theory underlying 
thermodynamics leads us to expect that this entropy is associated with the 
number of microstates of the (black-hole) system. It is precisely this aspect 
that is illuminated by string theory. As we shall see, for certain black-hole 
solutions of string theory, the number of microstates can be calculated, and the 
resulting multiplicity reproduces precisely the Bekenstein-Hawking formula for 
the entropy. 

In the next section, we review the definition of the black-hole event horizon 
and outline the proof that the area of its two-dimensional section cannot decrease. 
In section 10.3, we show why quantum mechanics requires black holes to have a 
temperature that is determined by their ‘surface gravity’ and entropy proportional 
to the area of the (two-dimensional section of the) event horizon. The number of 
perturbative microstates in string theory is evaluated in section 10.4 and shown to 
be quite inadequate to explain the derived entropy of black holes. The special 
class of black holes for which string theory is able to provide a microscopic 
explanation of their entropy are (certain) ‘extreme’ black holes, which have both 
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mass and charge (or, more generally, charges). These and their generalization 
to five dimensions are discussed in the following section. To get black holes 
from string theory, we need to find analogous solutions to the underlying classical 
field theory. This is type II supergravity, which is described in section 10.6. It 
involves the field strengths of certain (antisymmetric) ‘Ramond—Ramond’ gauge 
form-fields. These lead to a generalized notion of electric charge, which, in 
turn, indicates the existence of (non-perturbative) extended objects called 'D- 
branes’ which have non-zero Ramond-Ramond charges. This is described in 
section 10.7. In section 10.8 we construct the (five-dimensional, extreme) black- 
hole solutions, with three charges, that have an event horizon with non-zero area. 
If we use this area in the Bekenstein-Hawking formula, the entropy of the black 
hole is determined entirely by the (product of the) charges used. The non-zero 
charges have an immediate interpretation in terms of underlying microstates and 
the counting of these is done in section 10.9. The number of microstates obtained 
agrees precisely with that predicted from the calculated Bekenstein-Hawking 
entropy. 


10.2 Black-hole event horizons 


It is convenient to use mass units in which the Planck mass mp and, hence, 
Newton's constant Gy, are unity: mp = Gi" 2 1. The most well-known 
black-hole solution of general relativity is the Schwarzchild solution which gives 
in spherical polar coordinates the line element outside of a spherical body of mass 
M: 


ds? = guy dx" dx" (10.1) 


M 2M\"! 
= ( - =) dr? — ( - :3 dr? - r? 403 (10.2) 


where 
dQ3 = d6? + sin? 6 dg? (10.3) 


is the line element on the unit two-sphere 57. The metric is singular at r = 2M 
but this is merely a coordinate singularity. For example, a particle on a radial 
timelike geodesic r = R(t) falls from its starting position at r = R(0) > 2M, 
through R = 2M, and reaches R = O in a finite proper time (exercise 1). On a 
radial null geodesic, 


2M? 
dr? = (: - =A dr? = (dr*)? (10.4) 
where 
r =r +n 2M (10.5) 
= 770 . 
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As r ranges from 2M to oo, r* ranges from —oo to oo. Thus, d(t + r*) = 0 
on radial null geodesics and the ingoing radial null (Eddington—Finkelstein) 
coordinate is defined as 


r 
= *— — — — 
vettr thr + 2M In| 1| œ < v < œ. (10.6) 


Using v as a coordinate instead of t gives 
2 2M\ 2 2 492 — r2<in29 da2 
ds* = I=- dv^ — 2dudr — r^ d0^ — r“ sin“ 0 dọ<^. (10.7) 


This metric is defined initially for r > 2M, since the relation v = r + r*(r) is 
only defined for r > 2M. However, it can now be analytically continued to all 
r > 0 and, in these coordinates, there is no singularity at r = 2M. (There is 
a singularity at r — O where the curvature becomes infinite. Such a singularity 
cannot, of course, be removed by a coordinate transformation.) At large values of 
r, the light cones are almost Minkowskian and they allow a particle (or photon) 
to move outwards or inwards on a timelike (or null) worldline. However, as r 
decreases the lightcones gradually tilt over. When r 2M, 2dr dv < 0 for all 
non-spacelike (i.e. timelike or null) world lines. Since dv > 0 for future directed 
world lines, it follows that dr < 0, with equality for radial null geodesics when 
r — 2M. Whenr « 2M, all non-spacelike curves necessarily move inwards 
and hit the singularity at r = 0. Thus, if the massive body emits light from its 
(spherical) surface at r = rg « 2M, the light never escapes to an observer in 
the region r > 2M. Such an observer might infer the presence of the body from 
its gravitational field but she could not see it. The hypersurface traced out in 
spacetime by the spherical surface r — 2M is called the 'event horizon' of the 
(Schwarzchild) black hole. The area of a two-dimensional section of the event 
horizon is 

Au = An (2M)! = 16x M*. (10.8) 


Let S(x) be a smooth function of the spacetime coordinates x^ and 
consider a family of hypersurfaces S(x) = constant. The vectors normal to the 
hypersurfaces are given by 


- as 
ly = — 10.9 
p= f) axe (10.9) 
where f is an arbitrary non-zero function. If /? = 0 fora particular hypersurface 


N in the family, then N is said to be a ‘null’ hypersurface. So for the spherical 
hypersurfaces § = r = constant, with the black-hole metric (10.7), 


P= gly =P=- (i - at) F. (10.10) 


Thus, the event horizon r — 2M is a null hypersurface and (exercise 2) 


I^|, 24 = g""ly],-au = — fat. (10.11) 
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We can think of the metric (10.2) or (10.7) as being that which arises 
following the spherically symmetric collapse of a star with mass M 2 1.5- 
2mo. It is instructive to consider a series of light flashes emitted near the centre 
of the collapsing star, which is assumed to be made of transparent matter. In 
the early stages, the density of the star is low, the wavefront of the light will be 
approximately spherical and its area proportional to the square of the time elapsed 
since the emission of the flash. However, the gravitational attraction of the stellar 
matter through which the light is passing deflects neighbouring rays towards each 
other, reducing the rate at which they are diverging from each other. In other 
words, the gravitational effect of the matter is to focus the light and to reduce the 
area of the wavefront from what it otherwise would have been. In the early stages, 
the wavefront continues to increase in area, crossing the surface of the collapsing 
star and eventually reaching infinity. As the collapse continues, the matter density 
increases and so does the focusing effect, until a critical wavefront emerges from 
the surface of the star with zero divergence. Outside of the star, this wavefront 
will remain constant and will be the surface r — 2M discussed earlier whose 
spacetime evolution is the event horizon. It is the boundary of the spacetime 
region from which it is not possible to escape to infinity. It is generated by null 
geodesics which have no future endpoint but which do have past endpoints (at the 
emission of the flash.) The divergence of these null geodesic generators is positive 
during the collapse phase and zero in the final time-independent state. The area 
of a two-dimensional section of the event horizon increases monotonically from 
zero to the final value (10.8). Subsequent flashes will be focused so much by the 
stronger gravitational focusing that their rays begin to converge and the area of 
the wavefront decreases. 

Now consider what happens when a thin spherical shell of matter of mass M 
collapses from infinity at some later time and hits the singularity at r = 0. During 
the collapse, the metric is spherically symmetric but, of course, time-dependent. 
Afterwards, it will have the form (10.2) or (10.7) but with M replaced by M --5M. 
Since 5M is necessarily positive, the area of the two-dimensional section of the 
event horizon must increase: 


6An = 321 Mó6M > 0. (10.12) 


These results illustrate general results for black holes that are true even 
without spherical symmetry. The focusing or converging effect that follows 
from the fact that the gravitational mass is always positive can be described 
quantitatively using the positive-definiteness of the energy density. Consider a set 
of null geodesics, and let /^ = dx” /dv be a null tangent vector to these geodesics, 
where v is an affine parameter for the geodesic. At each point, we can define two 
unit spacelike vectors a^ and b” that are orthogonal to each other and to /#. It is 
convenient to define the complex vectors 


= ato" + ib") and m^" = g” — ib"). (10.13) 


Black-hole event horizons 279 
Then 
m'm, — 0 = m"m, — ^m, — I"m, and m"m, =—1. (10.14) 
The fact that the curves of this set are geodesics requires that 
laum” =0 (10.15) 


where, as usual, the semi-colon indicates covariant differentiation. The average 
rate of convergence of nearby null geodesics is encoded by the quantity 


p = dy,,m"m" (10.16) 


which is real provided that the null geodesics lie in a three-dimensional null 
hypersurface, as we shall assume. Let M be the null hypersurface generated 
by null geodesics with tangent vectors /^ and let AT be a small element of 
a spacelike two-dimensional surface in M. We can move each point of AT a 
parameter distance dv up the null geodesics. Then the area A of AT changes by 


8A = —2Apóv. (10.17) 


Thus, as we should expect, the area decreases if the convergence p 1s positive. 
The behaviour of p along the geodesics is determined from the Newman-Penrose 
equations [4] which for an affine parametrization (so /"L,,;, = 0) give 


TP L p? +08 + doo (10.18) 
where 
c zl,m"m" and — ġo = 3R,"I". (10.19) 


The Einstein field equations are 
Ry» — 3g, R = 82 Tav (10.20) 
where 7,,, is the energy-momentum tensor. Thus, 
goo = 41 T,,I"l". (10.21) 


The local energy density measured by an observer with velocity vector v^ is 
T, , v" v" and it is reasonable to assume that this is always non-negative. Then, 
from continuity, the ‘weak energy condition’ 


T,,w^"w" > 0 (10.22) 


follows for any null vector w^. With this assumption, (10.21) shows that $oo > 0 
and then, from (10.18), that the effect of the matter is always to increase the 
average convergence, i.e. to focus the null geodesics. 


280 Black holes in string theory 


For a general discussion of what can be seen from infinity and, therefore, of 
the event horizon, we need to determine the light cone structure of spacetime. For 
this purpose, it is useful to do a conformal transformation of the metric 


Suv > guv- (10.23) 


Such a transformation leaves the light-cone structure unaffected but can be chosen 
so as to compress everything near infinity and bring it to a finite distance. For 
Minkowski space, the line element is 


ds? = dt? — dr? — r? ac. (10.24) 
In light-cone coordinates u = t — r and v = t + r, this becomes 
ds? = du dv — 1(v — u}? ag2. (10.25) 


Now define new coordinates p and q such that tan p = v and tanq = u with 
p—qą > 0. Then 


ds? = sec? p sec? q[dpdq — } sin?(p — q) d23] (10.26) 


which shows that the Minkowski metric is conformally related to the metric 
whose line element d5? is in the square brackets. With a further coordinate 
transformation p = t' +r’, q =t — r! the line element becomes 


ds? = dt? — dr? — 1 sin? r^ Q3. (10.27) 


Thus, Minkowski space is conformal to the region bounded by the null surfaces 
Tt = (t +r’ = 2/2} and Z- = {t — r' = —7/2). It is the past light cone 
of the point it = (r' = 0,1! = 7/2) and Z- is the future light cone of the 
point i7 = (r' = 0, r’ = —7/2). All timelike geodesics start at i ^ , representing 
past timelike infinity, and end at i+, representing future timelike infinity. Null 
geodesics start at some point on the surface Z^ and end at some point on Z+, 
We are interested in (black-hole) spacetimes that are asymptotically flat. This 
means they must be ‘like’ Minkowski space near infinity, and so should have a 
similar conformal structure at infinity. In fact, the conformal metric is, in general, 
singular at the points i+ and i^ but regular on the null surfaces Z* and Z7. 

Consider the set J^ (S) consisting of a set S of spacetime points plus all 
points from which S can be reached by future-directed non-spacelike curves. The 
region of spacetime from which one can escape to infinity along a future directed 
non-spacelike curve is, therefore, J~ (I+), the causal past of future null infinity. 
The boundary of this region J~ (T+) is the general definition of the event horizon. 
It is generated by null geodesics segments which may have past endpoints but can 
have no future endpoints. Now, using the positivity of $0o, it follows from (10.18) 
that 


(10.28) 
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Suppose that the convergence p of neighbouring generators has a positive value 
po > 0 at some point q on a generator of J- (Z^), then p increases to infinity 
within a finite affine distance Av < 1/po to the future of g. The point r at 
which p becomes infinite is a focal point at which neighbouring null geodesics 
intersect. In other words, if the generators of J~(Z+) ever start converging, 
they are destined to have future endpoints within a finite affine distance. This 
contradicts the previously stated property that such generators have no future 
endpoints. It follows [1] that ọ < O0 everywhere on the event horizon and, 
therefore, from (10.17), that the area of the two-dimensional cross section cannot 
decrease with time!. As already noted, this prompted the observation that this 
area is analogous to the entropy of a thermodynamic system. 


10.3 Entropy of black holes 


In fact, the area of the two-dimensional section of the event horizon will remain 
constant only if the black hole is in a stationary state. If the black hole interacts 
with anything else the area always increases. In this respect, the area behaves 
similarly to the entropy of a thermodynamic system. In favourable circumstances, 
one can arrange that the increase in area can be made arbitrarily small, which 
corresponds to nearly reversible transformations in thermodynamics. 

During black-hole formation in the collapse of a star, the metric is strongly 
time-dependent and a complete classification of all solutions has not been 
found. However, the possible final stationary states have been identified. An 
asymptotically flat metric is called stationary if there exists a Killing vector? k 
that is timelike near infinity (where it may be normalized such that k? = 1). In 
other words, outside of the horizon k — 3. where t is a time coordinate. In these 
coordinates, the general stationary metric has a line element of the form 


ds? = goo(x) dt? + 2go; (x) dt dx! + gij (x) dx! dx. (10.29) 


A stationary metric is called static if it is also invariant under time-reversal, at 
least near infinity. Thus, the general static metric has go; = O and the line element 
takes the form 


ds? = goo(x) dt? + gij (x) dx! d. (10.30) 


! It would be possible to escape this conclusion if the generators were prevented from reaching the 
finite affine distance to the endpoint because of an intervening singularity. However, Hawking [5] has 
shown, using the general requirements of asymptotic predictability, that this does not occur. 

? A general coordinate transformation x — x’ is called an ‘isometry’ if the transformed metric 
gv) is the same function of its argument x"* as the original metric gy (x) was of its argument x^. 
The generators of such transformations may be found by considering an infinitesimal transformation 
in which x^ = x^ + e£ with € « 1. This is an isometry if & satisfies Suv + $v:u = 0, and 
any vector satisfying this is called a ‘Killing’ vector. We may equivalently write the Killing vector as 


E = gt ad, 
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With modest assumptions about causality, it can be shown that the only static 
(single) black-hole solution of the Einstein field equations is the (spherically 
symmetric) Schwarzchild solution given in (10.2). 

This result can be generalized to black holes with electric charge Q by 
solving the Einstein-Max well equations. They are derived from the action 


SEM f dx (-g) PIR — Gy Fuy FY"). (10.31) 


1 
~ 160Gn 
where F,,, is the electromagnetic field strength. Then the Einstein field equations 
(10.20) have the energy-momentum tensor 


1 l 3 
Tav = Am (rr. -— 5 8uvFysF” ) . (10.32) 
The only static solution is the Reissner-Nordstróm (RN) solution with line 


element 
2M Q? 2M QN. 
ds? = (1-5 + = J] de? -[1-—— +) 4üj-naej (10.33) 
and gauge potential 1-form 


A= A, dx" = 2a. (10.34) 


Like the Schwarzchild metric, this has a curvature singularity atr = 0. For 
M > |Ql. there are coordinate (but not curvature) singularities at r = r+ = M+ 
y M? — Q?. This assumes that M > |Q], since otherwise there are no horizons 
and the curvature singularity at r = 0 is ‘naked’. More generally, these results 
can be extended to stationary black-hole solutions. The stationary solutions of the 
Einstein equations are the (axially-symmetric) Kerr solutions, classified by two 
parameters, the mass M and the angular momentum J. Generalizing to solutions 
of the Einstein-Max well equations leads to the three-parameter Kerr-Newman 
metrics: 


A — a? sin? 24 g2_A 
ds? = SESE de? + 2a sin? + ^ ar ag 
(r? + a2)? — Aa? sin? 0 


X 
= sin? 0 dg? — A dr? — x d6? (10.35) 


where 
E-r?)-xalcoj0 and  Azr?-2Mr 4a? +Q. (10.36) 


The three parameters are M, a and Q. a is related to the total angular momentum 
J by 


a=—. (10.37) 
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If we allow magnetic charge P as well as the electric charge Q, we replace Q by 


e= JQ? P. (10.38) 


The Maxwell 1-form is 
cos 0 


L ar — asin? 6 d$) — Da dt — (r? + a°) dd). (10.39) 


The future and past event horizons are at 


A= 


r =r} = M (M? -— Q?-a?)'? (10.40) 
so that the future event horizon has area Ay given by 
Ay = An(2M? — Q? + 2(M4 — M? g? — J?)!/2), (10.41) 


This can be rewritten as 
T. (10.42) 


The first term on the right-hand side is the ‘irreducible’ part of M? that is 
irretrievably lost down the black hole. The second term is the contribution from 
the rotational energy of the black hole and the third and fourth terms arise from 
the electrostatic energy. The mass M, as opposed to M?, can also be written in an 
elegant form due to Smarr [6]: 


M =H 42047 + On0 (10.43) 
where, on the future event horizon, the surface gravity (i.e. the acceleration of 
a static particle as measured at spatial infinity) is x, the angular velocity is 24 
and the co-rotating electrostatic potential is ® y; all of these are constant on the 
horizon. If such a black hole is perturbed and settles down to another stationary 
black hole with parameters M 4- dM, J +dJ and Q + dQ, then (exercise 4) 


dM — a dAn +94 dJ + by dQ. (10.44) 
Comparing this with the thermodynamical (first law) formula 
dU =TdS + pdV +pdN (10.45) 


we see that if some multiple of the area Ay of a section of the event horizon is 
analogous to entropy, then some multiple of the surface gravity « on the horizon is 
analogous to the temperature. Bekenstein [7] suggested that these are not merely 
analogues but, in some sense, actually are the entropy and temperature of the 
black hole. 
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However, if the black hole has a temperature, it should radiate a black- 
body spectrum, thereby contradicting the defining property of a black hole that 
it can absorb particles or radiation but not emit them. Hawking’s resolution of 
this paradox noted that this absorption-only property is a feature of the classical 
theory of gravitation. He showed that quantum mechanical effects cause black 
holes to create and emit particles as if they were hot bodies with a temperature 
Ton given by 

Toh = —. (10.46) 
27 


Then, from (10.44) the (Bekenstein-Hawking formula for the) entropy Spn is 


Son = JAn. (10.47) 


The result is obtained by treating the spacetime metric classically but the 
matter fields to which it is coupled are treated quantum mechanically. We should 
expect this semi-classical approximation to be excellent except near a spacetime 
singularity. In flat Minkowski spacetime, a massless real scalar field $, for 
example, satisfies the field equation n^". „v = 0 and ¢ can be expanded in terms 


of annihilation and creation operators d; and âl as 


6-Y à f «alf (10.48) 


where the ( f;) are a complete orthonormal set of positive-frequency (complex) 
solutions of the wave equation 5^" fi;,, = 0: the positive frequency is defined 
with respect to the usual Minkowski time coordinate. The vacuum |0) is then 
defined to satisfy 

aj|0) 2 0 Vi. (10.49) 


In a curved spacetime with metric guv, the field equation becomes g^" $.,,, = 0 
with the semi-colon indicating covariant differentiation. However, in general, 
positive and negative frequencies have no invariant meaning in a curved spacetime 
and the expansion of $ in annihilation and creation operators is not defined. In a 
region of spacetime which was flat or asymptotically flat such an expansion can 
be made, but if we have a spacetime with an initial flat region (1), followed by a 
region of curvature (2), and then another flat region (3), the initial vacuum |0;) 
will not be the same as the final vacuum |03). This will lead to the interpretation 
that the time-dependent metric in (2) has led to the creation of a number of 
particles of the scalar field @. This is what happens in the core of a black hole [8], 
hidden from outside observers by the event horizon. When the radius of curvature 
of spacetime is smaller than the Compton wavelength of a given species, there is 
an indeterminacy in the particle number, that is to say, particle creation. Although 
these effects are negligible locally, they can have a significant influence on the 
black hole over the lifetime of the universe. 
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The most elegant derivation of the quantum result utilizes Feynman’s path- 
integral formulation of quantum field theory [9]. In this, the generating function 
W[J] for the Green functions of the quantum field theory is a functional integral 
over classical fields $ 


W[J] = f Poespiiste. J]) (10.50) 


where 


S{¢, JV = J dx [L(ġ, 8,0) + JO] (10.51) 


is the action integral with £ the Lagrangian density, and J a source current. 
In the present context, the functional integral is over both matter fields @ and 
metrics guv. The latter should include both metrics that can be continuously 
deformed to the flat metric as well as homotopically disconnected metrics such 
as those of black holes. The evaluation of the action integral is problematic for 
black-hole metrics because of the spacetime singularities they contain. However, 
this difficulty can be surmounted by complexifying the metric and evaluating the 
integral over a contour that avoids the singularities [10]. 

As an example, we take the Schwarzchild metric (10.2) which, as already 
noted, has a coordinate singularity at r = 2M and a curvature singularity at 
r = 0. The former can be removed by transforming to Kruskal coordinates in 
which the line element has the form 


e" /2M 
ds? = 324? —— — (dz? — dy?) — r^ a? (10.52) 
r 
where 
22- [L r/2M 

Z.y- 6 l)e (10.53) 
YF? QM (10.54) 

y-—-z 


The singularity at r = 0 is now on the surface z? — y? = 1 but it can be avoided 
by defining a new coordinate ¢ = iz. Then the metric has the Euclidean form 


—r/2M 
—ds? = 322 — (dt? + dy?) +r? ag) (10.55) 
where now 
+y = (Ga - De (10.56) 
vl n = e!/%M, (10.57) 


Thus, on the contour where 7 and y are real, r is real andr > 2M. Further, on this 
contour we define an imaginary time t by r = it and then (10.57) shows that c = 
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4M arg(y + if) is an angular coordinate with period B = 82M. The functional 
integral defining the generating function should, therefore, be over matter fields 
and metrics with this periodicity in r. But this is just the partition function Z for 
a canonical ensemble of the fields at temperature T = f^! = (81M) !. The 
surface gravity « for the Schwarzchild black hole is « = (4M)~', so that the 
temperature is T = «/27 in accordance with (10.46). (For the generalization 
of this result to the Reissner-Nordstróm black hole, see exercise 7.) With this 
established, it is clear that the black hole must have the entropy given by (10.47). 
Nevertheless, it is an interesting excercise to verify this directly by evaluating the 
action. 

In order to obtain a finite result, it is necessary not only to compute the 
(Euclidean) action by integrating over the previous imaginary time coordinate t 
but also over a finite region of space. In general, for a finite region M of D- 
dimensional spacetime, the Einstein-Hilbert action must be supplemented by a 
contribution evaluated on the boundary 3M which allows variations of the metric 
that vanish on 3M but which might have non-vanishing derivatives normal to it. 
In units where Gy = 1, the action can be written as 


1 
Sig, 0] = — f. V/-gRG) dx + — Í „VTB 010.58) 


where hap is the induced metric on 3M 


ax" Ax” 


= ag gee SHY (10.59) 


ab 
£^ are D — | coordinates on 0M, which is specified by an equation of the form 
f(x" (£^)) = 0. Up to a term C that depends only on the induced metric hap, 
B = K +C is just the trace K of the extrinsic curvature (the second fundamental 
form) Kgs of the boundary 8M: 


ax" ax” 


Kab = apa gpm 


(10.60) 
where n, is the unit outgoing normal to 3M and the semi-colon denotes a 
covariant derivative. Then 


av 9f af 


-1/2 af 
àxkox"| — x^ 


. 10.6 
axl ( D 


ny = 


For asymptotically flat metrics in D = 4 dimensions, where 3M can be chosen 
to be the product of the (imaginary) time axis with a 2-sphere of large radius R, it 
is natural to choose the constant C so that the action is zero for the flat Minkowski 
space metric yyy. Then 

B = K(g) — K(). (10.62) 


Since R(g) = 0, the action for the Schwarzchild black hole derives entirely from 
the surface term in (10.58). In the case of a spherical surface, f(x) =r — R —0 
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and we may use the remaining coordinates for the £^. Then the unit outgoing 
normal is 


nt = JT (10.63) 

rr 

and 19 
Kas = zn n: (10.64) 


For the Schwarzchild metric (10.2), this gives (exercise 8) 


2R — 3M 2M A!Z 
K = d —h = =i ] BEEN R? : 0 
O= aaam "50v i( R sin 
(10.65) 
and then 
2M A72 
f /—h[K (g) — K(n)| dx 2 —i4nB| 2R — 3M — 2R ( - :3 . 
8.M 
(10.66) 
Thus, in the limit R — oo, the Schwarzchild action is 
, 
Sigo 0] = 518M = TM. (10.67) 
K 


The general result [10] for Kerr-Newman metrics g(x w) of the form (10.35) 
is that the action integral has the value 


Slack), 0] = =M - 940). (10.68) 


(The rotation does not affect the evaluation of the action.) The dominant 
contribution to the path integral (10.50) comes from fields (in our case metrics 
g) with the correct periodicity that minimize the action. Such fields are solutions 
of the classica] equations of motion and, in the present context, are the Kerr- 
Newman metrics g(x y). Thus, 


In W[0] = In Z ^ iS[gx n)» 0]. (10.69) 


In a thermodynamic system, the partition function Z for a grand canonical 
ensemble at temperature T = 8^! with chemical potentials sz; associated with 
conserved charges Nj is defined as 


z = trexp| - 6(# - Duin) | (10.70) 
i 
and its logarithm is related to the free energy F by 


nz = pF = ~p(E-TS- Y uN); (10.71) 
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For the case we are considering in which B = 27/«, this gives 
2x be 
InZ = -—(M - T$-94Q-QnJ) - ——(M - Qon) (10.72) 


using (10.68) and (10.69) for the second equation. Thus 
5M =TS4+5OnO4+QuHJ. (10.73) 
From Smarr's formula (10.43), we also have that 


l 
luz 


1 
-È . . 
5 a A" t5 nQT QOgyJ (10.74) 


Then, comparing the two expressions, restoring the factors of h, c, Gy and kg, 
and using 


dk 0 he? 
~ 2nkg | 81GuMkg 
we deduce that the entropy is given by the Bekenstein-Hawking formula 
_ cA H 
^ 4Gyh 


(10.75) 


as anticipated in (10.47). 

It is worthwhile pausing for a moment to reflect upon this extraordinary 
result. We are accustomed to extensive quantities, such as the entropy of a 
thermodynamic system, being proportional to the volume of the system. Yet here 
the entropy is scaling as the surface area. The result generalizes to spacetimes 
with dimension D as 


M (10.76) 


4Gph 
where Ap is the (D — 2)-dimensional ‘area’ of the event horizon and Gp is 
the D-dimensional Newton constant. The entropy is thus essentially the horizon 
area measured in Planck units. This area dependence is an example of the 
*holographic' principle, and suggests that the fundamental degrees of freedom 
describing the system may be characterized by a quantum field theory with one 
fewer space dimensions and with an ultraviolet cut-off at the Planck scale [11]. 
We shall not pursue this intriguing suggestion further. 

The identification of the Bekenstein-Hawking entropy with the physical 
entropy of the black hole leads to two important puzzles. The first is the so- 
called ‘information problem’. Hawking [12] showed that the outgoing radiation 
from the radiating black hole is purely thermal and depends only on the conserved 
charges coupled to long-range fields. This clearly entails a loss of information, 
since two different, macroscopic objects having the same mass, a graduate student 
and her supervisor, for example, falling into the black hole would, according 
to an observer outside of the horizon, generate the same Hawking radiation. 
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String theory provides a unitary, quantum theory of gravity, so information cannot 
be lost. Thus, the information loss entailed in Hawking’s derivation must be 
an artefact of the semi-classical approximation used. However, it is not clear 
precisely where or how the approximation does breakdown nor, if it does, how 
the information is returned. We shall not pursue this topic further either. Instead, 
we turn to the second puzzle thrown up by the Bekenstein-Hawking result and 
the one on which string theory has been able to shed some light, namely whether 
there is an explanation of black-hole entropy in terms of microstates. 


10.4 Perturbative microstates in string theory 


Thermodynamics is only an approximation to a more fundamental description 
based on the statistical properties of the microstates of the system. So the 
fact that black holes have entropy suggests that this too should be understood 
microscopically. Roughly speaking, a thermodynamic system with entropy S is 
associated with a number e? of microstates of the system. For the Schwarzchild 
black hole, with a horizon area given by (10.8), we see from (10.75) that the 
entropy is Senh = 4r Gy M?/ħc. For the moment, the only important feature is 
that the associated number of states grows like e^ > We might have hoped that 
this approximates the number of perturbative string states with mass M. However, 
this is not the case, as we shall now demonstrate. 

It suffices to consider the open bosonic string. This has mode expansion [13] 


1 
X^ = x" plc + ils at cos(na) (10.77) 
n#0 n 
where the oscillator coefficients œ% are creation (annihilation) operators for 
n <0 (n > O). In units where the string length scale |; = 1/4zT is 1 (T is 
the string tension), the mass eigenvalues are given by the eigenvalues of 
lM?-N-1 (10.78) 
where 


N - Yo! od (10.79) 


is the number operator; the sum over i is over the Dr = 24 transverse dimensions 
of the bosonic string. We wish to estimate the number of (degenerate) states dn 
that have number-eigenvalue n. It is convenient to define a generating function 


oo 
G(w) = trw = aw (10.80) 
n=0 
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with |w] < 1. Now 
r3 i gi 
rw” = I] tr w*- 0» = 
mz-l 


oo 1 24 
[I (=) . (10.81) 


The function 


oo 
fw) s [[a -w» (10.82) 
m=1 
can be written as follows: 
oo oo mp oo p 
L Dm wP oo w 
nfw=J Inu -w=- Y^ > ea) Tya (1083) 
m-i m,p-l pz 


When w ~ 1, we can expand w? = | + p(w — 1) +--+, so we can approximate 
In f (w) by 


-1 <= 2 n? 
| x p2- T 10.84 
n f(w) pr» Sv) (10.84) 
Thus, 
2 
G(w) ~ exp ( a ;) when w ~ 1. (10.85) 


The required degeneracy dy may be obtained from G (w) by performing a contour 
integral 


d, = — Q ——dwv (10.86) 
where C is a closed loop around w = 0. The integrand vanishes rapidly as w — 1 


and, when n is large, w+! is small near w = 0. Thus, for large n, there is a 
saddle point near w = 1. In fact, the integrand is stationary when 


2x 
l-wc- ~ -—Inw. 10.87 
Jn4 1 ( ) 
It follows that 
d, x exp(4n yn) — asn — oo. (10.88) 


As a function of the mass M given in (10.78), the number of states o(M), 
therefore, increases as 
p(M) « exp(V/2z M) (10.89) 


quite inadequate for the black-hole entropy requirement that the number of states 
increases as 
Pen (M) e exp(4z Gy M?). (10.90) 
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Nevertheless, using a similar treatment, we shall see later that string theory can 
account for the entropy but the associated microstates are non-perturbative. Even 
then, the accounting has so far only been successful for certain special types of 
black hole called ‘extreme’ black holes. We therefore first describe the features of 
extreme black holes that are important for understanding the microscopic origin 
of their entropy. 


10.5 Extreme black holes 


It might, in any case, be objected that associating perturbative string states with 
black holes is absurd. The former are obtained by quantizing the string in 
a flat background spacetime. How could they be equivalent to a black hole? 
Nevertheless, certain perturbative states can be associated with ‘extreme’ black 
holes. We have already noted the constraint M > |Q| for the Reissner- 
Nordstróm (RN) metrics (10.33). Metrics saturating this constraint are called 
*extreme' (RN) black holes. The usual (supersymmetric) perturbative states in 
string theory satisfy M > |Q| and states saturating this inequality are called ‘BPS 
states’ after Bogomolnyi [14] and Prasad and Sommerfield [15]. They have the 
crucial property that their mass cannot receive quantum corrections and it is this 
that allows their association with extreme black holes. As the string coupling 
strength g, increases, the mass M of the perturbative state is unaltered, since 
it is independent of gs classically, and because of supersymmetry, there are no 
quantum corrections. However, the gravitational field of the state is determined by 
GN M (in four dimensions) and Gy is proportional to g2. Thus, as gs increases, 
the gravitational field increases, there is a back reaction on the perturbative state 
and, eventually, it may be described by a curved spacetime with large curvature. 
Thus, in principle at least, it might be possible to associate extreme black-hole 
spacetimes with perturbative states. 

The RN black hole looks like a natural starting place in the search for black 
holes in string theory. We can think of the Einstein-Maxwell action (10.31) as the 
bosonic part of the M = 2 supergravity theory in four dimensions. The (massless) 
gravity supermultiplet contains the graviton, two (fermionic) gravitinos and a 
vector boson called the ‘graviphoton’. The supersymmetry algebra is [13] 


(04, Qj = 2850; P, 
(Q4. OF} = 2e45Z2^? (10.91) 


where QA, with o. = 1,2 a (Weyl) spinor index and A = 1,2, are the 
two supersymmetry generators; the Qs are defined by Q4; = (QA); the 
2 x 2 matrices o^ with u = 0,1,2,3 are defined by o^. = (12,0), with 
a! (i = 1,2,3) the standard Pauli matrices; €ap and the ‘central charge’ zAB 
are antisymmetric with €12 = +1 and Z 1? = Z and, without loss of generality, 
we may choose Z > 0. (The graviphoton is a U(1) gauge boson coupled to a 
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charge which is, in fact, the central charge.) For massive representations, we may 
work in the rest frame where P, = (M, 0). Then it is easy to see (exercise 9) that 
we may form two linear combinations ay and bg of the generators that satisfy 


(es. a]] = 4(M + Z)8ap 
(ba, bj} = 4(M — Z)ôap 


(as, bf} = 0 = (au, bg) (10.92) 


Up to a normalization factor, these are just the anticommutation relations obeyed 
by two independent sets of fermion annihilation and creation operators. In 
general, starting with a state |V/) that is annihilated by ag and bg, we can 
construct a total of 16 states using the creation operators af and bł. Since 
(daa, a19) > 0 for any state |) and, similarly, for bg, the ‘BPS bound’ 


M>|Z| (10.93) 


follows. The inequality is saturated by representations (BPS states) for which |y) 
is also annihilated by one set of creation operators (oi if Z > 0). Thus, such states 
are ‘short’ massive representations of the supersymmetry algebra, since they are 
constructed using only the a} creation operators. They are invariant under half of 
the supersymmetry algebra. 

The extreme RN black hole, obtained from (10.33) and (10.34) by setting 
M = Q, is part of such a short hypermultiplet [16]. In this case, the two horizons 
are both at r = M = Q and, defining p = r ~ Q, we may write the solution in the 
‘isotropic” form in which the spatial part of the metric is conformal to flat space: 


ds? = H7? dr? — H? (dp? + p? dQ2) (10.94) 
A-(1- H^')dt (10.95) 
where 
_ Q 
Helen (10.96) 


Both the temporal and spatial *warp' factors (the factors multiplying the two 
parts of the metric), as well as the electromagnetic vector-potential 1-form, are 
determined by a single (harmonic) function H. Extreme solutions have the 
important property that they are easily generalized to a case representing N 
extreme black holes with charges q; = m; (with i = 1,2,..., N) by replacing 
the function H given in (10.96) by 


N 
gi 
H= . . 
1+ 2- Ir —ril (10.97) 


By Gauss' law, the total charge is Q — YN qi which, by the BPS bound, is also 


the total mass 
N N 
M= m=) qi - Q. (10.98) 
i=l i=l 
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There is, thus, no binding energy between the individual black holes: the 
gravitational binding is precisely cancelled by the electrostatic repulsion. Near 
the horizon |p| «& Q, the metric is approximated by AdS2 x S? (exercise 11). 
Thus, the extreme black hole may also be regarded as a soliton that interpolates 
between the Minkowski vacuum when p >» Q and AdS2 x S? when |p| « Q. 
Generalizations of this special form, involving extra dimensions, extended objects 
and other charges, are important in what follows. 

In particular, the simplest example of a string-theory black hole for which a 
microscopic description can be found is provided by a five-dimensional analogue 
of the RN solution (10.33) and (10.34). The static, solution of the five- 
dimensional Einstein-Max well equations outside of a ($*)-spherically symmetric 
body of mass M and charge Q is 


-1 
2M 2 2M 2 
ds? = -M2 dt? — -M,L dr? — r? dQ? (10.99) 
r2 r4 r2 r4 3 


dt. (10.100) 


(The r7? dependence of the potential arises, of course, because there are now 
four spatial dimensions.) As before, the extreme case where M — Q, with the 
horizon at r = yQ, may be cast in an isotropic form by using the coordinate 


p = Vr? — Q. Then the horizon is at p = 0 and 
ds? = H~? dr? — H (dg? + p? do?) (10.101) 
A-(1— H^'dt (10.102) 


where now the harmonic function is 
H=1+ 2. (10.103) 
p 


To connect this black hole with string theory, we need to consider the 
effective field theory describing string theory in the low-energy limit. This is 
a generalization of Einstein-Maxwell theory both with respect to the number of 
spacetime dimensions (ten) and the fields involved. The fields involved are just 
the massless modes that arise in (perturbative) string theory and the field theory 
that desribes them is type II supergravity. 


10.6 Type II supergravity 


The massless states that arise in superstring theory include both bosons and 
fermions. Black holes are, of course, solutions of the classical bosonic field 
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equations, so we shall only be concerned with the bosonic degrees of freedom. 
All closed superstring theories have massless (bosonic) modes associated with the 
graviton field G „y (a symmetric, traceless, rank-2 tensor), the Kalb-Ramond field 
Buv (an antisymmetric, rank-2 tensor) and the dilaton field $. (The expectation 
value of the dilaton fixes the string coupling constant gs via gs = (e9).) In 
type II superstring theories, these states arise in the NS-NS sector, i.e. they 
are states which are constructed using the Neveu-Schwarz (half-integer-moded) 
world-sheet fermion creation operators for both left- and right-movers. Massless 
bosonic states also arise in the R-R sector: these are constructed using the 
Ramond (integer-moded) world-sheet fermion creation operators for both left- 
and right-movers. In type IIA (in the light-cone gauge), the R-R states transform 
as [1] = 8, and [3] = 56 representations of the (transverse) SO(8) group, 
where [n] denotes the totally antisymmetric rank n tensor. They may, therefore, 
conveniently be represented by form fields C(1; and C(3) where 


1 
Cony = Cui. s dx ^ dx? A LLLA datn, (10.104) 


The representations [n] and [8 — n] of SO(8) are the same, since they related 
by the eight-dimensional e-tensor, so we could as well represent these fields by 
the forms Con) and C(s, respectively. The tree-level Weyl invariance of the string 
world-sheet action is preserved in the quantum string theory provided that the 
(renormalization-group) beta functions associated with these fields all vanish. The 
resulting equations amount to spacetime field equations for the background fields 
that would arise from the effective action: 


1 - 
SpA — zz f dx (-G)!2e ??R(G) 
2k19 
+ fiae ^ *d$ — 1 Ho) ^*H) — } Fe) A * Fay 


— 5 Fay ^ * Fay — 5 By) ^ Fay A Fel] 
(10.105) 


where G = det[G pv], 

2x1 = Qn)y'o'^g? (10.106) 
is related to the ten-dimensional Newton constant by 2x2, = l6zGio; a’ is 
related to the string tension T by a’ = x = i2 (where /, is the string length 
scale); the dilaton 1-form is df = 8, dx", and the field-strength forms are 
related to the potentials by 


H3) = dB) Fo) = dC) Fa = dC) 
Fa = Fa + Cay A Ho) (10.107) 
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By) is the 2-form associated with the Kalb-Ramond field and the (10- 
dimensional) Hodge dual *C,p) is 


-G 1/2 
Cip) = X0 jen moa MEE dx" a dx”? A... Adx"9-r, 
(10.108) 
It is important to bear in mind that the effective action (10.105) is an 
approximation that is good in the low-energy limit o^ — 0. Higher-order terms 
in the curvature are negligible provided R(G)a’ « 1. Roughly speaking, we 
may say that the metric obtained by solving the lowest-order field equations is 
only well defined on length scales / >> /,. Note that all terms from the NS- 
NS sector are multiplied by e^?? , while terms from the R-R sector are not 
coupled to the dilaton. This is a feature of the ‘string frame’ in which the action 
(10.105) is written. To remove the dilaton factor from the curvature term, as in 
the conventional ‘Einstein frame’ in which we have worked hitherto, we perform 
the following field redefinition 


Guy = e?" guy. (10.109) 


Then the effective bosonic action in the Einstein frame is 


I ] _ * 
Sia = zl f dx (—g)! 2 R(g) — 5 f [d$ ^ "dé + e * } Ho) A " Hoy 
10 


e? Foy A * Foy + e Fa ^ * Fay + Bo ^ Fa ^ Foy] 
(10.110) 


Type II string theory has M = 2 supersymmetry which, in 10 dimensions, 
is realized by two Majorana- Weyl spinors Qo and Qs, each having have 16 real 
components, so there is a total of 32 supersymmetry charges; Q acts on the right- 
movers, and Ó on the left-movers. In type IIA theory, the two spinors have 
opposite chirality, while type IIB both spinors have the same chirality. Thus, 
type HB is a chiral theory and type IIA non-chiral. The R-R states in type IIB are 
also represented by form fields but now with components transforming as even- 
ranked tensors C(o, C2) and C(aj—the only subtlety is that the [4]. = 35 of 
SO(8) is self-dual. Analogous forms of the actions (10.105) and (10.110) may 
also be written in terms of the field strengths F1), Fa) and F(5) derived from 
these R-R sector fields. Otherwise, the structure of the two forms of the action 
is very similar to the type IIA case and we shall not reproduce them here. The 
self-duality constraint (of the 5-form field strength) must be applied as an extra 
condition on the solution of the field equations. The important point is that we 
may consistently truncate the type ITA and type IIB effective actions to include 
only the graviton, dilaton plus one field-strength tensor Fin) (or H(3)). This is 
non-trivial because it must be verified that the (local) supersymmetry variation of 
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the (zero) fermion fields is constantly zero. In the Einstein frame, this gives an 
action of the form 


l 1 - 
Sn = zl J d? (— g)?R(g) — = | [dg ^ *d$ + 0-7 Fin) A “Foll. 
2Ki0 2 


(10.111) 
The value of a determines the coupling to the dilaton and may be read off from 
the type IIA or IIB action. If the chosen field strength is H(3) deriving from 
the NS-NS form Baz), then a = 1, whereas if the chosen field strength derives 
from an R-R field, then a = (n — 5)/2. In the latter case, n = 2,4,... 
corresponds to type IIA, and n = 1,3, 5, ... corresponds to type IIB. The fact that 
the type II string theory effective action (10.110) involves various field strengths 
Fin) suggests that there should be some objects in the underlying string theory 
that couple directly to the associated gauge form fields, just as an electron is 
coupled by its charge to the Maxwell gauge potential A,. We shall see in the 
next section that these objects are extended objects, p-branes, generally having p 
spatial dimensions. 


10.7 Form fields and D-branes 


There is a geometric aspect of the antisymmetric forms which gives important 
insights. A gauge field A, is coupled naturally to the world line X” (r) of a 
charged particle by a term in the action of the form 


dx? 


Under a (U(1)) gauge transformation, the vector potential 1-form 
A, dx" = Aq) > Am t dA (10.113) 


where Ao, is a O-form, i.e. a function. The field strength 2-form Fo = dA 1) is 
gauge invariant and satisfies 
dF) = 0 (10.114) 


which in four dimensions are two of Maxwell’s equations. The other two are 
d* Fay = *Jay (10.115) 


where Ja) = J, dx* is the current 1-form. * F(2) and * J) are the Hodge duals. 
They are defined in ten dimensions in (10.108) but have an obvious generalization 
to any dimensionality. In four dimensions, *Ji is a 3-form, and we may use 
Gauss’ theorem to find the electric charge Q in some spatial volume V3 enclosed 


by a surface So: 
o= w= f *F. (10.116) 
V3 52 
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The string world-sheet X“ (rt, o) has an analogous coupling to the Kalb-Ramond 
field Buy: 


S~ f 5, età xt Pe (10.117) 


where &,) are the two world-sheet coordinates (t, a) respectively, €?Ê is the 
antisymmetric tensor with €?! = —e!? = —1. The gauge transformation 


B > Bay t dA) (10.118) 


leaves the 3-form field strength H(; = dB» invariant and, analogously to 
(10.114), 
dH) = 0 (10.119) 


Evidently, the string, and (some of) its excitations, have a non-zero value 
of the NS-NS ‘electric’ charge associated with the B,, gauge field. The 
generalization of (10.116) is that the electric charge Q of the (one-dimensional) 
string, associated with the 2-form gauge potential that is enclosed by the seven- 
dimensional hypersurface 57, is given by 


Qi =f * Ho). (10.120) 
S7 


For a general antisymmetric (p + 1)-dimensional tensor gauge field, the 
generalization of (10.112) and (10.117) is a term in the action of the form 


s ~J C1; i; pii €P Bag XP a, XH? ap XPott dP +E. (10.121) 


This describes the coupling of the C(p41) form gauge field to the (p + 1)- 
dimensional world volume of an extended object having p spatial dimensions 
(a p-brane) that has a non-zero value for the NS-NS or R-R electric charge 
associated with the NS-NS or R-R sector gauge form fields described in the 
previous section. Under a gauge transformation, a gauge form field transforms 
as 

Cops) > Cip+i) t dA(p (10.122) 


and the field strength F(542, = dC(p41) is invariant. The electric charge of the 
p-brane associated with the gauge potential, that is enclosed by the hypersurface 
Sg—p is, therefore, generally given by 


Qp =f * Fipe2)- (10.123) 
Sg- 


In addition to the field strength H(3, deriving from the Kalb-Ramond NS- 
NS form field Bọ), which we have already noted is coupled ‘electrically’ to 
the string world sheet, the effective action for type ITA superstrings (10.105) 
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or (10.110) also involves the field strengths Fin) (with n even) that derive from 
the R-R fields C(41). However, unlike the NS-NS field Bo), these fields are 
not coupled electrically to the string world sheet or its excitations. In fact, the 
vertex operator for the emission of an R-R state involves the corresponding field 
strength rather than the gauge field. It therefore vanishes at zero momentum and, 
in consequence, the pertubative string states are electrically neutral with respect 
to the charge associated with the R-R gauge fields. The foregoing discussion 
suggests that the R-R field strengths F(,) are naturally associated with branes 
having p = n —2 dimensions. Thus, type ITA superstring theory is also associated 
with O-branes (i.e. point particles), 2-branes (membranes) and 4-branes having 
non-zero values of the associated R-R charges. Similarly, type IIB superstring 
theory is associated with 1-branes (R-R strings), 3-branes and 5-branes. Since 
the perturbative string states are neutral with respect to the charges associated 
with the R-R fields, type II (A or B) string theory has to be augmented with 
non-perturbative dynamical objects (p-branes) that do have R-R charges, i.e. 
electric charges associated with the R-R gauge form fields. The branes are called 
‘Dirichlet p-branes' or 'Dp-branes', because besides the closed-string sector, 
there has to be an open-string sector in which the open string ends on these 
p-dimensional hyperplanes [17, 18]. Hence, the open-string world sheet has 
Dirichlet boundary conditions in the directions perpendicular to the branes. It 
turns out that these D p-branes do couple electrically to the associated (closed- 
string) R-R fields, just as the fundamental string is coupled electrically to the 
NS-NS Kalb-Ramond field B,,. (More precisely, D-branes act as a source for 
the associated R-R gauge fields.) It is this enlargement of (the theory formerly 
known as) string theory that has led to the understanding of black-hole entropy in 
terms of the associated microstates. We now turn to the construction of the explicit 
black-hole solutions of the type II supergravity field equations whose entropy we 
shall eventually be able to explain in terms of D p-branes. 


10.8 Black holes in string theory 


As we have just noted, besides fluctuations of the string, string theory also 
has various non-perturbative solitons. These are static, finite-energy solutions 
of the classical field equations, just as RN black holes are static finite-energy 
solutions of the classical Einstein-Max well field equations. It follows from the 
previous discussion that a (“black brane’) solution of the field equations deriving 
from the action (10.111) for gy», with non-zero Fin), will give the gravitational 
field associated with an (n — 2)-brane having the associated NS-NS or R-R 
charge. The field equations may be simplified by looking for solutions that have 
Poincaré invariance in the p + 1 = n — 1 dimensions associated with the p-brane 
world volume and rotational invariance in the remaining transverse directions. 
The coordinates x^ are, therefore, split into longitudinal ones, denoted x^ with 
a = 0,1,..., p, and transverse ones, denoted y! with i = (p + 1),...,9. The 
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metric is then assumed to have the ‘warped’ form reminiscent of that encountered 
earlier in (10.94) and (10.101): 


ds? = e^ O nap dx? dx? — e?85,; dy! dy? (10.124) 


where A and B are functions of the radial coordinate r = y yiyi. When the p 


spatial dimensions x!,...,x? are compactified the metric could then describe a 


higher-dimensional black hole. For the dilaton, the ansatz is 
$ - f(r) (10.125) 
and, for the antisymmetric tensor, the assumption is 
Coi2...p = eC — 1. (10.126) 


Then it turns out [19] that the classical field equations following from (10.111) 
have a solution in which all of the functions A, B, C, f are determined by a single 
harmonic function H (r): 


ds? = Hy(r) -Phap dx? dx? — Hy (r)9* 9/55, dy dy. (10.127) 


e? = py (r)8- 9/4 (10.128) 
Cis) = LHp(r) ! — 1]dxP A dx! ... A dx? (10.129) 
with 
Li? 
Hy(r) = 1+ m (10.130) 
The length L, is defined by 
T-p _ Kio [tp 3- 
LL? = NP Qna)? (10.131) 
P o0 (- pgs-p 
where qp is an integer and 
(n-1)/2 
Qu = (10.132) 


r((n + 1)/2) 
is the volume of the unit n-sphere S”. Using (10.109), the line element in the 
string frame is 
ds? = H,(r) "n4, dx? dx? — Hy (r)! 28; dy! dy? (10.133) 


with all other fields unaltered. In fact, these solutions are extreme solitons. As in 
the case of the four-and five-dimensional black-hole solutions (10.33) and (10.99), 
the mass Mp can be read off from the warp factor Hp(r). It is the coefficient of 
2x2)/ (7 — p)&s-p that plays the rôle of Newton's constant in this case. Thus, 


Mp = a5 3 Y* aea) = , ar (ave) n. (10.134) 
; 
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As expected of Dp-branes, the solitons we have found are non-perturbative 
objects whose mass diverges as gs — 0. However, the fact that the mass scales as 

-1 rather than &;? shows that they are unconventional solitons, quite unlike the 
sphalerons whose action is given in (4.172), for example. The electric charge Op 
associated with the C(5..1) form gauge potential is given in (10.123): 


1 
5p 


2k19 


Thus, the solitons have gp units of the fundamental Dp-brane R-R ‘electric’ 
charge pp = (27) P (o m 2/g;. Further, Mp = Qp, so, as claimed, these 
are extreme states. There is an exact cancellation between the attractive forces 
from NS-NS fields due to the mass of the soliton and the repulsive Coulomblike 
electrical forces from the R-R fields due to its charge. As in the four-dimensional 
case, this signals the fact that one-half of the supersymmetry charges are preserved 
by the solution, which, in this case, is invariant under the 16 supersymmetry 
charges - 

Qa + PQo (10.136) 


where Q and Q act, respectively, on the right- and left-movers and P represents 
the operator that reflects the directions y! transverse to the brane. 

The first question then is: do any of these solutions give us the metric of 
a black hole? If, with the benefit of hindsight, we try to obtain the extreme 
five-dimensional RN solution given in (10.101), (10.102) and (10.103), then the 
obvious first attempt is to choose p — 5. (We are, therefore, considering type IIB 
superstring theory, since p is odd.) This gives four (transverse) spatial dimensions 

6 ,7 18 9 
EDAD sys 
= 09? + 0? + o5? + O° (10.137) 

and the harmonic function Hs(r) varies as r~2. As previously noted, if the five 
longitudinal coordinates x!, x?, x3, x4, x5 are compactified (on a 5-torus T? say), 
the metric resembles the five-dimensional RN black hole (10.101). However, the 
warp factors in both the longitudinal and transverse directions are wrong and, 
further, the dilaton is singular on the event horizon at r = ry = 0. (The 
quantum states associated with a soliton are found by identifying the zero modes 
(or collective coordinates) and quantizing them. This is not possible if the soliton 
is singular.) Now, because the solitons are extreme, we may combine solutions 
with different values of p, provided that (some) supersymmetry is preserved. 
This requires that some of the supersymmetry charges (10.136) preserved by one 
solution are also preserved by the other. Thus, if Pj and P» represent, respectively, 
reflection of the coordinates transverse to the pi- and p2-solitons, the preserved 
supersymmetry charges satisfy 


Qa + PiQa = Qa + PO. = Qa + POL! Pr) Ou (10.138) 


and we see that the unbroken supersymmetries correspond to +1 eigenvalues of 
P ! Py. In general, this requires that the number of directions that are transverse 
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to one brane and parallel to the other is a multiple of four and that a (p — 2)-brane 
can lie within a p-brane [20]. In the present context, therefore, we may combine 
the p = 5 solitons described earlier with the p = 1 solitons, which we take 
to have longitudinal coordinates x, x!—the remaining four x? coordinates are 
transverse to the 1-brane but parallel to the 5-brane. Then again just half of the 
supersymmetry charges are preserved leaving just eight, corresponding to M = 2 
supersymmetry in four dimensions. 
In the first instance, the harmonic function H is a function of 7 where 


P = o9 -o on? (10.139) 


and varies as 7^9. However, when the four directions x?, x3, x$, x5 are 
compactified on T* c T? of volume V4, this has the effect of *smearing' the 
DI-branes. For example, compactifying the coordinate x? on a circle of radius 
R, the system of D1-branes is replaced by an infinite array of parallel branes 
a distance 27 R apart. Because they are extreme states, the effective harmonic 
function Hi for the array is easily written down and, for r >> R, this function 
varies as [F? — (x2)?]757, as if the DI-branes also wrapped x. The effective 
number of D2-branes is aal / R. This result obviously generalizes to the case 
in which all four coordinates are compactified on T^. Then Hy varies as r~, just 
like Hs(r), and is given by (10.130) with p = 5 but with the effective number qs 


of D5-branes given by 

. Qx Vant 

qs = Ya qı 

The composite system has warp factors that are given by the ‘harmonic 

function sum rule’ [21]. With the proviso that the harmonic function H; for 
the D1-branes is replaced by the smeared version Hi just described, so that both 
of the harmonic functions are functions of the overall transverse distance r, the 
warp factors of the composite system are just the product of the separate warp 
factors in each of the three sectors, namely parallel to both, transverse to both and 


transverse to the 1-brane but parallel to the 5-brane. The dilaton is also given by 
the product. Thus, for the combined system, 


(10.140) 


ds? = H "ÄT IAO} — (dx?) — ng "B" pax? +- + (di5?] 


_ He! Äi tay$? Ree "ne (10.141) 

ef = Hy? AI? (10.142) 
Cos) = [H5  — 1]dx? A dal... Adi? — Cay = (Ay! — de® A dx'. 

(10.143) 


Then the dilaton is, as required, finite at the event horizon. (We note, incidentally, 
that this is only possible when at least two 'charges' are activated.) In addition, 
the warp factor associated with the y! directions now approximates the H warping 
in (10.101). Thus, ignoring the directions x?, x3, x4, x? for the moment, all 
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that is needed to complete the resemblance to (10.101) is a further warping 
approximating H^! in the x9-direction. In any case, in its present form, 
the metric does not have a finite horizon ‘area’, because g}; vanishes at the 
horizon: it requires a further warping approximating H. In fact, the missing 
ingredient is supplied by adding momentum in the x  -direction. Since the extreme 
solution is boost invariant, we cannot add momentum simply by boosting it. 
The generalization arises [21] because the solutions have a null-hypersurface 
orthogonal isometry which permits the replacement 


(dx)? — (dx!)? > (de)? — (dx)? + Hp (ry (dx? — dx')? (10.144) 


for an arbitrary harmonic function Hp(r). This is exactly what happens if we 
do a Kaluza-Klein reduction along the x!-direction but with a non-zero off- 
diagonal component in the metric g91/g11 = H p(r)^! (see exercise 7). Indeed, 
the resulting metric can be obtained from a non-extreme metric by this means: 
the dilaton and gauge potential form fields are unaffected. The final form of the 
metric is 
ds? = du, ^ Ay (ax)? m (dx? + Hp(dx® — dx!)?] 
- Hs ÀI (dx)? + + (d35y?] — H5" ldr? + 7.423). 


(10.145) 
We may write the harmonic functions in the form 
- r2 L? r? 
My=1+4 Hs=1+ 3 Hp = $ (10.146) 
r r 
where, using (10.106), (10.131) and (10.140), 
2 (20 at 2 r2 = ngla Qz a) a 
ri = 418,0 —,.n 5-7 q5850' = ng;a' -n E 
(10.147) 


with R the radius of the circle upon which the x!-coordinate is compactified, and 
n an integer specifying the (right-moving) momentum P = n/R on the circle. 
Adding this momentum breaks a further half of the supersymmetries leaving a 
total of four preserved supersymmetry charges. This corresponds to N = | 
supersymmetry in five dimensions. The ‘area’ Ay of the event horizon is defined 
as the (eight-dimensional) volume of the time slice at the horizon. Taking the 
limit r — 0 from above, this gives a product of factors, one from each of the 
three disjoint pieces of the metric: 


Au = (tr, L] "rpn Riri L y vago" 13/4) 282} 


= Am? RVariLsrp. (10.148) 


The surface gravity and, hence, the black-hole temperature is zero, as might be 
expected from an extreme black hole (see exercise 7). Using the (generalized) 
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Bekenstein-Hawking formula (10.75), the entropy Sp, associated with this 
solution is 

Son = AH = 2n J/diqsn (10.149) 

4G10 

independently of the size of the compact dimensions and of the string coupling 
85. Thus, the entropy is determined entirely by the integers q1, q5 and n and it is 
this feature that allows the identification of the associated microstates. The fact 
that the entropy of a black hole at zero temperature is non-zero does not imply 
a violation of the third law of thermodymamics, if indeed the analogy between 
black-hole dynamics and thermodynamics is exact. The version of the third law 
that suggests that there is, is a statement about the equations of state for ordinary 
matter. 


10.9 Counting the microstates 


We have already argued that Dp-branes are sources of R-R charge and we have 
also shown in (10.135) that the soliton solution (10.129) has qp units of p-brane 
R-R charge. Thus, the obvious interpretation of the black-hole solution that we 
have just constructed is that it is a bound state made out of qs D5-branes and 
qi DI-branes (D-strings) with some momentum n/R. However, D p-branes are 
defined as pointlike objects in their transverse dimensions in an otherwise flat 
spacetime. The R-R solitons that we have derived are only asymptotically flat, so 
why do we believe that they are made of D-branes? We have noted previously that 
the effective action from which these solitons derive is a good approximation so 
long as the curvature is small R(G)o" « 1. The length scale defined in (10.131), 
associated with the solution (10.129), is given by Lp ~ (sap)! C-P Ja’. Thus, 
when g;qp > 1, the curvature is small and the soliton solutions are valid. In 
fact, the effective supergravity equations were derived using string perturbation 
theory, which is valid only when gs < I. Consequently the soliton solutions 
apply only when q, is large, so that the curvature is small. When the string 
coupling 2; is very small, the R-R solitons are very massive, as is apparent from 
(10.134). However, their gravitational field is proportional to GioMp and, since 
Gio X g?, the associated spacetime becomes flat as g, — 0. Also, the horizon 
area Ay = 4G10Sph approaches zero in this limit. When it is smaller than the 
string scale P = 4n7q’, the higher-order curvature terms become important 
and the flat space description becomes valid. Provided that gsqı «& 1 and that 
8sQ5 < 1, we are considering weakly-coupled D-branes in a flat spacetime. In 
this case, it is straightforward to count the number of configurations. We shall 
return shortly to the case where g,qp > 1 in which our black-hole solutions are 
valid. 

The configuration in question (10.145) breaks the 10-dimensional Lorentz 
symmetry SO(1,9) — SO(1, 1) x SO(4)| x SO(4),. The first factor acts on 
the D-string world sheet (x, x!), the second factor on the rest of the D5-brane 
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world Volume (x?, x3, x^, x5), and the final factor on the remining dimensions 
(yÉ, y^, y$, y 9) that are transverse to both. This symmetry forbids rigid branes 
from carrying linear or angular momentum. So the question arises as to what 
degrees of freedom do carry the momentum n/R. An obvious possibility is the 
massless states of the open strings that begin and end on D-branes. (The massive 
excitations of the D-branes have masses proportional to g7 ! and, hence, do not 
play a role when the coupling is weak.) The 1-1 states, in which the open string 
begins on a D1-brane and ends on a D1-brane, generate a vector supermultiplet in 
the adjoint representation of the U (q1) gauge group, similarly for the 5-5 states 
with gauge group U (qs). In geometrical terms, the VEVs of the scalar fields 
in these supermultiplets correspond to separations of the individual D1- and D5- 
branes from each other. This takes us away from the black-hole state, which 
has maximal degeneracy. So instead we consider the 1-5 and 5—1 states. These 
generate hypermultiplets in the (qj, Q5) - (Q4. 45) of U(q1) x U(qs), which gives 
a total of 4qıq5 (scalar) bosons and an equal number of (Weyl) fermions. The 
vacuum expectation values of the scalars are associated with the 4 coordinates 
(transverse to the D1-branes) of each D1-brane giving its position relative to each 
D5-brane. This configuration must be made to Carry P= n/ R momentum in 
the x!-direction. With the four coordinates x?, x3, x4, x * compactified on a torus 
whose size is small compared to that of the circle on which x! is compactified 
(i ^ « R), we effectively have a two-dimensional field theory on the world 
volume (x9, x!) of the D-string. The Hamiltonian is H = n/R and this 
has to be distributed among the 4gıq5 bosons and fermions. Apart from the 
(minor) complication introduced by having fermions, this is precisely the problem 
discussed in section 10.4 in which we estimated the number of (bosonic) string 
states having mass n (in string units). 

As before, the problem may be solved using a generating function (the 
partition function) 


oo 
Z(w) ztrw^ = y daw" (10.150) 


where N = PR and d, is the (required) number of states having eigenvalue n of 
N (and, therefore, right-moving momentum n/R). Then 


oo 
1+w™ 44145 
N 
tr = . . 
w I (I) (10.151) 


m=l 


The fermions give the terms in the numerator and the bosonic contribution in 
the denominator is derived precisely as in (10.81). As in section 10.4, we may 
estimate d, for large values of n (exercise 12) with the result [22] 


d, ~ exp (2x /qiqsn) . (10.152) 


Hence, 
In d, ~ 2x J/qiqsn = Son (10.153) 
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using (10.149). This is a spectacular result. We have recovered the Bekenstein— 
Hawking formula (10.149) for the entropy of the extreme black hole (10.145) 
from counting the microstates that make up this black hole. We assumed that 
vè /^ & R inorder to simplify the counting of states but this is not essential. Since 
the number of states (10.152) does not change when the radii are continuously 
varied, we know that the entropy is given by (10.153) for all values of R. Also, 
the states were counted in the limit g,q1 « 1 and gsgs « 1 where we have 
D-branes in flat space, whereas the black-hole solution (10.145) is valid only 
when gsq1 > l and gsq5 > 1 so that higher-order curvature corrections are 
negligible. However, because these are extreme solutions, they are protected by 
their supersymmetry and we may assume that the calculated degeneracy will not 
undergo renormalization by quantum effects. 

This five-dimensional RN black hole utilizes three non-zero U (1) charges— 
the two R-R charges q; and qs, and the momentum n/R in the internal x!- 
direction—and this is the minimum number needed to get a finite area with a 
regular horizon. In four dimensions, a minimum of four non-zero charges is 
needed. The result can be generalized to near-extreme black holes, in which case 
the entropy becomes a function of the mass of the black hole as well as its four 
charges. We shall not pursue this further. The interested reader is referred to one 
of the excellent reviews [23-25] in the literature. 


10.10 Problems 


1. Show that a particle on a radial timelike geodesic r = R(t) in a Schwarzchild 
spacetime falls from rest at r = R(0) > 2M to R = 0 in proper time 


r= mM (1 - 1=2MRO) ^. 


2. Verify that the vector ^ normal to the event horizon of a Schwarzchild black 
hole has components (10.11) in Eddington-Finkelstein coordinates. 

3. Verify that the Kerr-Newman metric reduces to the Reissner-Nordstróm 
metric when the angular momentum J = 0. 

4. Verify Smarr's formula and (10.44) for the Kerr-Newman metric where the 
surface gravity is x = (r4 —r—) /2( +a?) and the co-rotating electrostatic 


san: — r. 
potential is bj = cu 
5. The surface gravity « can be calculated directly using the formula 


R= —1x'^" xuvlr=ry 

where x is a timelike Killing vector normal to the horizon (and normalized so 
that x? = | at spacelike infinity) and the semi-colon indicates the covariant 
derivative. Using the metric (10.7) and the Killing vector x^ = ô}, verify 
that x is a unit timelike Killing vector and that the previous formula is 
satisfied by « — 1/4M, as required. 
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6. Calculate the Schwarzchild radius for a black hole having mass M = Mo. 
Calculate also its temperature and entropy. 

7. By choosing new coordinates p = fr —r4 and t = it, show that near 
the horizon the line element for the Reissner-Nordstróm black hole may be 
written in the form 


—ds? ~ Ap? dv? + 4A7! dp? + r° ap? (10.154) 


where A = (r4. —r_)/ r. Hence, show that t is an angular coordinate with 
period 4z A~! and, therefore, that the temperature of the extreme black hole 
is zero. 

8. Evaluate the extrinsic curvature Kap of the Schwarzchild metric on a 
spherical shell of radius R and verify equations (10.65) and (10.66). 

9. Show that the M = 2 supersymmetry algebra (10.91) can be written in the 
form (10.92). 

10. Express the function f (w) given in (10.82) in terms of the Dedekind eta 


function 
oo 
n(t) = eitt/12 [Ja — e2timty | 
mzl 
Using the property 
n(—1/t) = (-it)!/?n(z) 
show that 


—2n 4? 1 n? 4n? 
- -1/24 
f) (=) w eP | Ginw f Iw] 
Hence, show that 


2 
fw) ~ AC — w)7! exp (E) forw ^1 


and find the power-law correction to the exponential dependence given in 
(10.89). 

11. For the extreme Reissner-Nordstróm black hole with the metric (10.94), 
show that near the horizon the metric approximates Ad $2 x S?. 

12. Show that the partition function Z(w) defined in (10.150) and (10.151) has 
the asymptotic behaviour 


2 
Z(w) ~ exp (zs) forw ^1 


and, hence, verify the degeneracy (10.152) of states having momentum n/ R. 
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10.11 General references 


The books and review articles that we have found most useful in preparing this 
chapter are: 


e Hawking S W 1973 The Event Horizon Black Holes Cours de l'Ecole d'été 
de Physique Theorique, Les Houches (New York: Gordon and Breach) 1 

e Townsend P K Cambridge University lecture notes on Black Holes, arXiv:gr- 
qc/9707012 

e Horowitz G T 1996 The origin of black hole entropy in string theory Seoul 
1996, Gravitation and Cosmology 46, arXiv:gr-qc/960405 1 

e Johnson C V 2003 D-Branes (Cambridge: Cambridge University Press) 


Bibliography 


[1] Hawking S W 1971 Phys. Rev. Lett. 26 1344 
[2] Bekenstein J 1972 Lett. Nuovo Cimento 4 737 
(3) Hawking S W 1974 Nature 248 30 
[4] Newman E T and Penrose R 1962 J. Math. Phys. 3 566 
[5] Hawking S W 1973 The event horizon in black holes Cours de l'Ecole d'été de 
Physique Theorique, Les Houches (New York: Gordon and Breach) p 1 
[6] Smarr L 1973 Phys. Rev. Lett. 30 71 
Smarr L 1973 Phys. Rev. Lett. 30 521(E) 
[7] Bekenstein J 1973 Phys. Rev. D 7 2333 
Bekenstein J 1974 Phys. Rev. D 9 3392 
[8] Hawking S W 1975 Commun. Math. Phys. 43 199 
[9] See, for example, Bailin D and Love A 1986 Introduction to Gauge Field Theory 
(London: IOP) 
[10] Gibbons G W and Hawking S W 1977 Phys. Rev. D 15 2752 
[11] 't Hooft G 1990 Nucl. Phys. B 335 138 
[12] Hawking S W 1976 Phys. Rev. D 14 2460 
[13] See, for example, Bailin D and Love A 1994 Supersymmetric Gauge Field Theory 
and String Theory (Bristol: IOP) 
[14] Bogomolnyi E B 1976 Yad. Fiz. 24 861 (Engl. transl. Sov. J. Nucl. Phys. 24 449) 
[15] Prasad M and Sommerfield C 1975 Phys. Rev. Lett. 35 760 
[16] Gibbons G W and Hull C M 1982 Phys. Lett. B 109 190 
[17] Dai J, Leigh R G and Polchinski J 1989 Mod. Phys. Lett. A 4 2073 
Leigh R G 1989 Mod. Phys. Lett. A 4 2767 
(18] Polchinski J 1994 Phys. Rev. D 50 6041 
Polchinski J 1995 Phys. Rev. Lett. 75 4724 
[19] Horowitz G T and Strominger A 1991 Nucl. Phys. B 360 197 
[20] Polchinski J, Chaudhuri S and Johnson C V Notes on D-branes, arXiv:hep- 
th/9602052 
[21] Tseytlin A A 1996 Nucl. Phys. B 475 149, arXiv:hep-th/9604035 
[22] Strominger A and Vafa C 1996 Phys. Lett. B 379 99, arXiv:hep-th:9601029 
(23] Mandal G 1999 Lectures Presented at the ICTP Spring School, April 1999 arXiv:hep- 
th/0002184 


308 Black holes in string theory 


[24] Wadia S R 1999 Lectures Given at Advanced School on Supersymmetry in the 
Theories of Fields, Strings and Branes, Santiago de Compostela, Spain, 26-31 
July 1999 and at Workshop in String Theory, Allahabad, India, 18 Oct-18 Nov 
1999 and at ISFAHAN STRING SCHOOL AND WORKSHOP Isfahan, Iran, 1-14 
May 2000 arXiv:hep-th/0006190 

[25] Peet A W 1999 Theoretical Advanced Study Institute in Elementary Particle 
Physics (TASI 99) Lectures: Strings, Branes, and Gravity, Boulder pp 353-433, 
arXiv:hep-th/0008241 


Index 


A-terms, 139 
abundance, 148 
affine, 279 
Affleck-Dine mechanism, 137-142 
age of universe, 8-10 
axion, 155 
astrophysical constraints, 
159-161 
bremsstrahlung, 160, 162 
cosmology, 161—169 
dark matter, 169 
decays, 157, 158 
DFSZ or GUT, 159-161, 169 
interactions, 156 
invisible, 159 
KSVZ or hadronic, 159, 161, 169 
misalignment, 165 
Primakoff process, 160 
production, 160, 161 
relic abundance, 163 
thermal production, 161 
visible, 158 
axions 
and SN1987A, 161 


B — L conservation, 103, 104, 116 
B — L non-conservation, 111, 139 
baryogenesis, 91-142 

and inflation, 113 

in $0(10) GUT, 110-113 

in SU(5) GUT, 108-110 

in supersymmetric electroweak 

theory , 132-137 
necessary conditions, 94—96 


via Aflleck-Dine mechanism, 
137-142 
via sphalerons, 120-127 
via superheavy boson decay, 
96-99 
baryon asymmetry, 91 
value of, 93 
value of ng, 93 
value of 5, 92 
baryon number non-conservation 
in SU (5) GUT, 101, 103 
in the standard model, 114—120 
Bekenstein-Hawking formula, 284, 
288, 303, 305 
big-bang nucleosynthesis, 21—27 
black hole 
entropy, 281, 284, 288 
extreme, 291—293 
microstates, 289 
Reissner-Nordstróm, 282, 292, 
293, 300 
Schwarzchild, 277 
temperature, 284 
bounce solution, 59, 60 
BPS bound, 292 
BPS states, 291 
bubble, 130, 132-133 
bulk region, 185, 189 


chaotic inflation, 217 

chemical potentials, 136 

Chern-Simons number, 119, 123, 
126 

CMSSM, 181, 185 


309 


310 Index 


co-annihilation tail, 185, 189 
cold dark matter, 175 
comoving wavelength, 210 
conformal transformation, 280 
cosmic microwave background 
(CMBR), 21 
cosmic string, 69 
dynamics of local strings, 76 
energy density, 73 
gravitational field, 74 
local, 71 
cosmological constant, 6, 8, 10-17, 
174 
and anthropic principle, 16-17 
and supersymmetry, 15-16 
evidence for, 12 
scale-factor time dependence, 8 
value of, 15 
cosmological inflation, 195 
Cowsik-McClelland bound, 151 
CP problem, 151-156 
CP violation 
in baryogenesis, 95 
in electroweak theory, 127-129 
critical density (p,), 6, 7 
critical temperature, 40 
in a GUT, 49 


D p-branes, 298, 300 

and supersymmetry, 300 
D-term, 138 

supergravity inflation, 232 
damping, 206 
dark energy, 11 
dark matter, 173—174 

cold, 175 

detection, 187-192 

supersymmetric, 174-192 
deceleration parameter, 4, 13 
decoupling, 24 
Density perturbations 

size of, 213 
density perturbations, 210 
deuterium bottleneck, 22 


DFSZ axion, 159-161, 169 
diffusion equations, 134 
dilaton, 249, 294, 300, 301 
as inflaton field, 259 
decay, 253 
Kahler potential, 256 
stabilization of, 255, 256 
domain wall, 66 
double images, 75 


e-folds, 200, 201, 204, 205, 219, 
232, 236 
Eddington-Finkelstein coordinate, 
277 
effective potential, 33 
finite-temperature, 34, 38, 41 
Einstein frame, 295 
Finstein-Max well equations, 282, 
293 
ekpyrotic universe, 272 
electroweak sphaleron rate, 
126-127 
energy conservation equation, 206 
energy density (o), 33, 147 
entropy 
black hole, 281, 284, 288, 303, 
305 
conservation of, 196 
entropy density (s), 31, 33, 148 
equation of state, 7 
equilibrium number density, 147 
Euclidean action, 121, 122 
event horizon, 277, 280, 283, 292, 
300, 301 
area, 278, 302 
extreme black holes, 291—293 
extreme Reissner-Nordstróm, 292 
extreme soliton, 299, 300 
extrinsic curvatire, 286 


F-flat, 138 

F-term, 138 
Fayet—Iliopoulos term, 233 
flat directions, 137-138 


flatness problem, 197, 200 

focus-point region, 185 

form fields, 294, 296-298 

free energy density (F), 31, 32 
of ultrarelativistic gas, 33 

freeze-out, 23, 24, 150 

frictional term, 203 

Friedman-Robertson- Walker 

(FRW), 6 
Friedmann equation, 6, 8 
funnel, 185 


gauge kinetic function, 57 
graceful exit, 201 
grand unified theory (GUT) 
magnetic monopoles, 85 
phase transition, 49 
gravitino problem, 177-179 
gravitinos 
density of, 237 
mass, 177 
mass of, 251 
thermal production of, 237 
GUT axion, 159 
GUT baryogenesis, 99-1 14 


hadronic axion, 159 
Hagedorn region, 263 
harmonic function, 292, 293, 299, 
302 

sum rule, 301 
helium abundance, 26 
hidden sector, 238 
Higgs model, 36 
Hodge dual, 296 
holographic principle, 288 
horizon, 210 
horizon problem, 195, 199 
hybrid inflation, 220 

in supergravity, 234 


imaginary time, 121 
inflation 
and axion abundance, 167 


Index 311 


chaotic, 217 

D-brane, 265 

e-folds of, 200, 201, 204, 205, 

219 

graceful exit from, 201 

hybrid, 220 

new, 201 

old, 199 

oscillatory period, 207 

reheating, 206, 231 

reheating temperature, 207 

slow roll, 201 

supergravity, 227 

thermal, 254 
inflationary potential 

energy scale of, 213 
inflaton, 202 

moduli field as, 259 
inflaton field, 206 

complex, 216 

fluctuations, 208 

operator, 208 
information problem, 288 
instanton, 152 


Kahler potential (K), 56, 233 
for moduli fields, 256 

Kalb-Ramond field, 294, 297 

Kerr-Newman metric, 282 
action, 287 

Kibble mechanism, 68 

Killing vector, 281 

kink soliton, 68 

Kruskal coordinates, 285 

KSVZ axion, 159, 161, 169 


light-cone coordinates, 280 
luminosity distance, 12 
luminous matter, 173 


magnetic flux, 72 

magnetic monopoles, 80 
abundance of, 86 
energy of, 81 


312 Index 


in GUTs, 85 

topological quantum number, 83 
matter domination, 7 

scale-factor time dependence, 8 

temperature time dependence, 19 
microstates, 289—291, 303-305 
misalignment, 165, 168 
moduli, 249 

as inflatons, 259 

Kahler potential, 256 
momentum, 302 
MSSM, 175, 179-181 
mSUGRA model, 181 


neutralino, 181 

annihilation, 183, 184 

elastic scattering, 188, 189 
neutrino decoupling, 150 
neutrino temperature, 150 
new inflation, 201 
Newman-Penrose equations, 279 
NS-NS 

electric charge, 297 
NS-NS sector, 294, 295 
nucleosynthesis 

starting time, 26 
null hypersurface, 277 
number of spatial dimensions, 264 


observable sector, 238 
old inflation, 199 


p-brane, 297 
electric charge, 297 
particle horizon (dy), 2-3, 86, 196, 
200 
partition function, 30, 31, 286, 287 
Peccei-Quinn symmetry, 154 
perturbative microstates, 289 
phase transitions 
first order, 43, 47, 51, 59 
in a GUT, 49 
in electroweak baryogenesis, 
129-132 


in electroweak theory, 45 
in the Higgs model, 36, 37, 41 
Planck mass, 6 
reduced, 6, 56 
Polonyi problem, 238 
Pontryagin index, 152 
power spectrum, 221 
PQ charge, 155 
pre-big-bang cosmology, 269 
preheating, 207 
pressure, 33, 147 


R-R 
electric charge, 297-298, 300 
R-R sector, 294, 295 
R-parity, 139, 179 
radiation domination, 7, 8 
scale-factor time dependence, 8 
temperature time dependence, 18 
transition from, 19-21 
recombination, 21 
redshift, 3, 4 
reheating, 206, 231 
reheating temperature, 207, 237 
Reissner-Nordstróm black hole, 
282, 300 
extreme, 292 
five-dimensional, 293 
relic abundance 
axions, 163-165 
relic neutrinos, 150-151, 172 
Robertson- Walker metric, 2-5 
running coupling constants, 54 


Sakharov criteria, 94—96, 127, 139 
scale-independent spectrum, 221 
Schwarzchild metric 

action, 287 
Schwarzchild solution, 285 
Scwarzchild solution, 276, 282 
slow roll, 201 

conditions for, 204 
Smarr’s formula, 283, 288 
SO(10) GUT, 110 


soliton, 298, 300 

extreme, 299, 300 
spatial dimensions 

number of, 264 
spectral index, 220, 221 
sphaleron, 124 

strong, 135 

washout condition, 131 
static metric, 281, 282 
stationary metric, 281 
stationary solution, 282 
string cosmology 

ten-dimensional, 260 
string frame, 295, 299 
SU (5) GUT, 101 
supergravity inflation, 227 

D-term, 232 
superpotential, 51 
supersymmetry algebra, 291 
supersymmetry breaking, 57 
supersymmetry charges, 300, 302 


Index 313 


surface gravity, 283, 286 
symmetric phase, 48, 49 


temperature discontinuities, 76 

thermal inflation, 254 

thermal production of gravitinos, 

237 

thermodynamics, 283 

6-term, 152, 154 

thin-wall approximation, 60, 61 

topological defects, 65 

tunnelling rate, 59, 62, 122 

type II supergravity, 293 
effective action, 294 


warp factor, 292, 300, 301 
warped metric, 299 
washout condition, 131 
WIMP, 175 

annihilation, 189, 192 


